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REAGENTS AND METHODS USEFUL FOR DETECTING 
DISEASES OF THE BREAST 



Cross-Reference to Related Application 
10 This application is a continuation-in-part of U.S. application Serial No. 

08/879,354, filed June 20, 1997, from which priority is claimed pursuant to 35 
U.S.C. §120 and which is incorporated herein by reference in its entirety. 

Background of the Invention 
15 This invention relates generally to detecting diseases of the breast. 

Furthermore, the invention also relates to reagents and methods for detecting 
diseases of the breast. More particularly, the present invention relates to reagents 
such as polynucleotide sequences and the polypeptide sequences encoded thereby, 
as well as methods which utilize these sequences. The polynucleotide and 
20 polypeptide sequences are useful for detecting, diagnosing, staging, monitoring, 
prognosticating, preventing or treating, or determining predisposition to diseases 
or conditions of the breast, such as breast cancer. 

Breast cancer is the most common form of cancer occurring in females 
in the U.S. The incidence of breast cancers in the United States is projected to 
25 be 180,300 cases diagnosed and 43,900 breast cancer-related deaths to occur 
during 1998 (American Cancer Society statistics). Worldwide, the incidence 
of breast cancer increased from 700,000 in 1985 to about 900,000 in 1990. 
G.N. Hortobagyi et al., CA Cancer J Clin 45: 199-226 (1995). 

Procedures used for detecting, diagnosing, staging, monitoring, 
30 prognosticating, preventing or treating, or determining predisposition to 
diseases or conditions of the breast, such as breast cancer, are of critical 
importance to the outcome of the patient. For example, patients diagnosed 
with early breast cancer have greater than a 90% five-year relative survival rate 
as compared to a survival rate of about 20% for patients diagnosed with 
35 distantly metastasized breast cancers. (American Cancer Society statistics). 
Currently, the best initial indicators of early breast cancer are physical 
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examination of the breast and mammography. J.R. Harris et al. In: Cancer: 
Principles and Practice of Oncology . Fourth Edition, pp. 1264-1332, 
Philadelphia, PA: J/B. Lippincott Co. (1993). Mammography may detect a 
breast tumor before it can be detected by physical examination, but it has 
limitations. For example, mammography's predictive value depends on the 
observer's skill and the quality of the mammogram. In addition, 80 to 93% of 
suspicious mammograms are false positives, and 10 to 15% of women with 
breast cancer have false negative mammograms. C.J. Wright et al., Lancet 
346: 29-32 (1995). New diagnostic methods which are more sensitive and 
specific for detecting early breast cancer are clearly needed. 

Breast cancer patients are closely monitored following initial therapy 
and during adjuvant therapy to determine response to therapy, and to detect 
persistent or recurrent disease, or early distant metastasis. Current diagnostic 
procedures for monitoring breast cancer include mammography, bone scan, 
chest radiographs, liver function tests and tests for serum markers. The serum 
tumor markers most commonly used for monitoring patients are 
carcinoembryonic antigen (CEA) and CA 15-3. Limitations of CEA include 
absence of elevated serum levels in about 40% of women with metastatic 
disease. In addition, CEA elevation during adjuvant therapy may not be 
related to recurrence but to other factors that are not clinically important. CA 
15-3 can also be negative in a significant number of patients with progressive 
disease and, therefore, fail to predict metastasis. Both CEA and CA 15-3 can 
be elevated in nonmalignant, benign conditions giving rise to false positive 
results. Therefore, it would be clinically beneficial to find a breast associated 
marker which is more sensitive and specific in detecting cancer recurrence. J. 
R. Harris et al., supra. M. K. Schwartz, In: Cancer: Principles and Practice 
of Oncology. Vol. 1 , Fourth Edition, pp. 531 - 542, Philadelphia, PA: J/B. 

Lippincott Co. 1993. 

Another important step in managing breast cancer is to determine the 
stage of the patient's disease because stage determination has potential 
prognostic value and provides criteria for designing optimal therapy. 
Currently, pathological staging of breast cancer is preferable over clinical 
staging because the former gives a more accurate prognosis. J. R. Harris et 
al., supra . On the other hand, clinical staging would be preferred were it at 
least as accurate as pathological staging because it does not depend on an 
invasive procedure to obtain tissue for pathological evaluation. Staging of 
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breast cancer could be improved by detecting new markers in serum or urine 
which could differentiate between different stages of invasion. Such markers 
could be mRNA or protein markers expressed by cells originating from the 
primary tumor in the breast but residing in blood, bone marrow or lymph 
nodes and could serve as sensitive indicators for metastasis to these distal 
organs. For example, specific protein antigens and mRNA, associated with 
breast epithelial cells, have been detected by immunohistochemical techniques 
and RT-PCR, respectively, in bone marrow, lymph nodes and blood of breast 
cancer patients suggesting metastasis. K. Pantel et al, Onkologie 18: 394-401 
(1995). 

Such diagnostic procedures also could include immunological assays 
based upon the appearance of various disease markers in test samples such as 
blood, plasma, serum or urine obtained by minimally invasive procedures which 
are detectable by immunological methods. These diagnostic procedures would 
provide information to aid the physician in managing the patient with disease of 
the breast, at low cost to the patient. Markers such as prostate specific antigen 
(PSA) and human chorionic gonadotropin (hCG) exist and are used clinically for 
screening patients for prostate cancer and testicular cancer, respectively. For 
example, PSA normally is secreted by the prostate at high levels into the seminal 
fluid, but is present in very low levels in the blood of men with normal prostates. 
Elevated levels of PSA protein in serum are used in the early detection of prostate 
cancer or disease in asymptomatic men. See, for example, G.E. Hanks et al., In: 
Cancer: Principles and Practice of Oncology, Vol. 1, Fourth Edition, pp. 1073- 
1113, Philadelphia, PA: J.B. Lippincott Co. 1993. M. K. Schwartz et al., In: 
Cancer: Principles and Practice of Oncology, Vol. 1, Fourth Edition, pp. 531- 
542, Philadelphia, PA: J.B. Lippincott Co. 1993. Likewise, the management of 
breast diseases could be improved by the use of new markers normally expressed 
in the breast but found in elevated amounts in an inappropriate body compartment 
as a result of the disease of the breast. 

Further, new markers which could predict the biologic behavior of 
early breast cancers would also be of significant value. Early breast cancers 
that threaten or will threaten the life of the patient are more clinically important 
than those that do not or will not be a threat. G.E. Hanks, supra. Such 
markers are needed to predict which patients with histologically negative 
lymph nodes will experience recurrence of cancer and also to predict which 
cases of ductal carcinoma in situ will develop into invasive breast carcinoma. 
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More accurate prognostic markers would allow the clinician to accurately 
identify early cancers localized to the breast which will progress and 
metastasize if not treated aggressively. Additionally, the absence of a marker 
for an aggressive cancer in the patient could spare the patient expensive and 
non-beneficial treatment. J. R. Harris et al., supra. E. R. Frykberg et al., 

Cancer 74: 350-361 (1994). 

It therefore would be advantageous to provide specific methods and 
reagents useful for detecting, diagnosing, staging, monitoring, 
prognosticating, preventing or treating, or determining predisposition to 
diseases or conditions of the breast. Such methods would include assaying a 
test sample for products of a gene which are overexpressed in diseases and 
conditions associated with the breast, including cancer. Such methods may 
also include assaying a test sample for products of a gene which have been 
altered by the disease or condition associated with the breast, including cancer. 
Such methods may further include assaying a test sample for products of a 
gene whose distribution among the various tissues and compartments of the 
body have been altered by a breast-associated disease or condition, including 
cancer. Such methods would comprise making cDNA from mRNA in the test 
sample, amplifying, when necessary, portions of the cDNA corresponding to 
the gene or a fragment thereof, and detecting the cDNA product as an 
indication of the presence of the disease or condition including cancer or 
detecting translation products of the mRNAs comprising gene sequences as an 
indication of the presence of the disease. Useful reagents include 
polynucleotide(s), or fragment(s) thereof which may be used in diagnostic 
methods such as reverse transcriptase-polymerase chain reaction (RT-PCR), 
PCR, or hybridization assays of mRNA extracted from biopsied tissue, blood 
or other test samples; or proteins which are the translation products of such 
mRNAs; or antibodies directed against these proteins. Such assays would 
include methods for assaying a sample for product(s) of the gene and detecting 
the product(s) as an indication of disease of the breast. Drug treatment or gene 
therapy for diseases and conditions of the breast including cancer can be based 
on these identified gene sequences or their expressed proteins, and efficacy of 
any particular therapy can be monitored. Furthermore, it would be 
advantageous to have available alternative, non-surgical diagnostic methods 
capable of detecting early stage breast disease, such as cancer. 
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Summarv of the Invention 

The present invention provides a method of detecting a target BS124 
polynucleotide in a test sample. The method comprises contacting the test sample 
with at least one BS 124-specific polynucleotide and detecting the presence of the 
target BS 124 polynucleotide in the test sample. The BS 124-specific 
polynucleotide has at least 50% identity with a polynucleotide selected from the 
group consisting of SEQUENCE ID NO 1, SEQUENCE ID NO 2, SEQUENCE 
ID NO 3, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and fragments or 
complements thereof. Also, the BS 124-specific polynucleotide may be attached to 
a solid phase prior to performing the method. 

The present invention also provides a method for detecting BS124 mRNA 
in a test sample, which comprises performing reverse transcription (RT) with at 
least one primer in order to produce cDNA, amplifying the cDNA so obtained 
using BS124 oligonucleotides as sense and antisense primers to obtain BS124 
amplicon, and detecting the presence of the BS124 amplicon as an indication of 
the presence of BS124 mRNA in the test sample, wherein the BS124 
oligonucleotides have at least 50% identity with a sequence selected from the 
group consisting of SEQUENCE ID NO 1, SEQUENCE ID NO 2, SEQUENCE 
ID NO 3, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and fragments or 
complements thereof. Amplification can be performed by the polymerase chain 
reaction. Also, the test sample can be reacted with a solid phase prior to 
performing the method, prior to amplification or prior to detection. This reaction 
can be a direct or an indirect reaction. Further, the detection step can comprise 
utilizing a detectable label capable of generating a measurable signal. The 
detectable label can be attached to a solid phase. 

The present invention further provides a method of detecting a target 
BS124 polynucleotide in a test sample suspected of containing target BS124 
polynucleotides. The method comprises (a) contacting the test sample with at least 
one BS124 oligonucleotide as a sense primer and at least one BS124 
oligonucleotide as an anti-sense primer, and amplifying same to obtain a first stage 
reaction product; (b) contacting the first stage reaction product with at least one 
other BS124 oligonucleotide to obtain a second stage reaction product, with the 
proviso that the other BS124 oligonucleotide is located 3' to the BS124 
oligonucleotides utilized in step (a) and is complementary to the first stage reaction 
product; and (c) detecting the second stage reaction product as an indication of the 
presence of a target BS 124 polynucleotide in the test sample. The BS 124 
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oligonucleotides selected as reagents in the method have at least 50% identity with 
a sequence selected from the group consisting of SEQUENCE ID NO 1, 
SEQUENCE ID NO 2, SEQUENCE ID NO 3, SEQUENCE ID NO 4, 
SEQUENCE ID NO 5, and fragments or complements thereof. Amplification 
may be performed by the polymerase chain reaction. The test sample can be 
reacted either directly or indirectly with a solid phase prior to performing the 
method, or prior to amplification, or prior to detection. The detection step also 
comprises utilizing a detectable label capable of generating a measurable signal; 
further, the detectable label can be attached to a solid phase. Test kits useful for 
detecting target BS124 polynucleotides in a test sample are also provided which 
comprise a container containing at least one BS 1 24-specific polynucleotide 
selected from the group consisting of SEQUENCE ID NO 1, SEQUENCE ID NO 
2, SEQUENCE ID NO 3, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and 
fragments or complements thereof. These test kits further comprise containers 
with tools useful for collecting test samples (such as, for example, blood, urine, 
saliva and stool). Such tools include lancets and absorbent paper or cloth for 
collecting and stabilizing blood; swabs for collecting and stabilizing saliva; and 
cups for collecting and stabilizing urine or stool samples. Collection materials 
such as papers, cloths, swabs, cups, and the like, may optionally be treated to 
avoid denaturation or irreversible adsorption of the sample. The collection 
materials also may be treated with or contain preservatives, stabilizers or 
antimicrobial agents to help maintain the integrity of the specimens. 

The present invention also provides a purified polynucleotide or fragment 
thereof derived from a BS 124 gene. The purified polynucleotide is capable of 
selectively hybridizing to the nucleic acid of the BS 124 gene, or a complement 
thereof. The polynucleotide has at least 50% identity with a polynucleotide 
selected from the group consisting of (a) SEQUENCE ID NO 1 , SEQUENCE ID 
NO 2, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and complements thereof, 
and (b) fragments of SEQUENCE ID NO 1, SEQUENCE ID NO 2, and 
SEQUENCE ID NO 3. Further, the purified polynucleotide can be produced by 
recombinant and/or synthetic techniques. The purified recombinant polynucleotide 
can be contained within a recombinant vector. The invention further comprises a 
host cell transfected with the recombinant vector. 

The present invention further provides a recombinant expression system 
comprising a nucleic acid sequence that includes an open reading frame derived 
from BS124. The nucleic acid sequence has at least 50% identity with a sequence 
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selected from the group consisting of SEQUENCE ID NO 1 , SEQUENCE ID NO 
2, SEQUENCE ID NO 3, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and 
fragments or complements thereof. The nucleic acid sequence is operably linked 
to a control sequence compatible with a desired host. Also provided is a cell 
transfected with this recombinant expression system. 

The present invention also provides a polypeptide encoded by BS124. 
The polypeptide can be produced by recombinant technology, provided in purified 
form, or produced by synthetic techniques. The polypeptide comprises an amino 
acid sequence which has at least 50% identity with an amino acid sequence 
selected from the group consisting of SEQUENCE ID NO 22, SEQUENCE ID 
NO 23, SEQUENCE ID NO 24, SEQUENCE ID NO 25, and fragments thereof. 

Also provided is an antibody which specifically binds to at least one 
BS124 epitope. The antibody can be a polyclonal or monoclonal antibody. The 
epitope is derived from an amino acid sequence selected from the group consisting 
of SEQUENCE ID NO 22, SEQUENCE ID NO 23, SEQUENCE ID NO 24, 
SEQUENCE ID NO 25, and fragments thereof. Assay kits for determining the 
presence of BS 124 antigen or anti-BS 124 antibody in a test sample are also 
included. In one embodiment, the assay kits comprise a container containing at 
least one BS124 polypeptide having at least 50% identity with an amino acid 
sequence selected from the group consisting of SEQUENCE ID NO 22, 
SEQUENCE ID NO 23, SEQUENCE ID NO 24, SEQUENCE ID NO 25, and 
fragments thereof. Further, the test kit can comprise a container with tools useful 
for collecting test samples (such as blood, urine, saliva, and stool). Such tools 
include lancets and absorbent paper or cloth for collecting and stabilizing blood; 
swabs for collecting and stabilizing saliva; and cups for collecting and stabilizing 
urine or stool samples. Collection materials such as papers, cloths, swabs, cups, 
and the like, may optionally be treated to avoid denaturation or irreversible 
adsorption of the sample. These collection materials also may be treated with or 
contain preservatives, stabilizers or antimicrobial agents to help maintain the 
integrity of the specimens. Also, the polypeptide can be attached to a solid phase. 

In another embodiment of the invention, antibodies or fragments thereof 
against the BS 124 antigen can be used to detect or image localization of the antigen 
in a patient for the purpose of detecting or diagnosing a disease or condition. 
Such antibodies can be polyclonal or monoclonal, or made by molecular biology 
techniques, and can be labeled with a variety of detectable labels, including but not 
limited to radioisotopes and paramagnetic metals. Furthermore, antibodies or 
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fragments thereof, whether monoclonal, polyclonal, or made by molecular biology 
techniques, can be used as therapeutic agents for the treatment of diseases 
characterized by expression of the BS124 antigen. In the case of therapeutic 
applications, the antibody may be used without derivitization, or it may be 
derivitized with a cytotoxic agent such as a radioisotope, enzyme, toxin, drug, 

prodrug, or the like. 

Another assay kit for determining the presence of BS 124 antigen or anti- 
BS124 antibody in a test sample comprises a container containing an antibody 
which specifically binds to a BS124 antigen, wherein the BS124 antigen 
comprises at least one BS124-encoded epitope. The BS124 antigen has at least 
about 60% sequence similarity to a sequence of a BS124-encoded antigen selected 
from the group consisting of SEQUENCE ID NO 22, SEQUENCE ID NO 23, 
SEQUENCE ID NO 24, SEQUENCE ID NO 25, and fragments thereof. These 
test kits can further comprise containers with tools useful for collecting test 
samples (such as blood, urine, saliva, and stool). Such tools include lancets and 
absorbent paper or cloth for collecting and stabilizing blood; swabs for collecting 
and stabilizing saliva; cups for collecting and stabilizing urine or stool samples. 
Collection materials such as papers, cloths, swabs, cups and the like, may 
optionally be treated to avoid denaturation or irreversible adsorption of the sample. 
These collection materials also may be treated with, or contain, preservatives, 
stabilizers or antimicrobial agents to help maintain the integrity of the specimens. 
The antibody can be attached to a solid phase. 

A method for producing a polypeptide which contains at least one epitope 
of BS124 is provided, which method comprises incubating host cells transfected 
with an expression vector. This vector comprises a polynucleotide sequence 
encoding a polypeptide, wherein the polypeptide comprises an amino acid 
sequence having at least 50% identity with a BS124 amino acid sequence selected 
from the group consisting of SEQUENCE ID NO 22, SEQUENCE ID NO 23, 
SEQUENCE ID NO 24, SEQUENCE ID NO 25, and fragments thereof. 

A method for detecting BS 124 antigen in a test sample suspected of 
containing BS124 antigen also is provided. The method comprises contacting the 
test sample with an antibody or fragment thereof which specifically binds to at 
least one epitope of BS124 antigen, for a time and under conditions sufficient for 
the formation of antibody/antigen complexes; and detecting the presence of such 
complexes containing the antibody as an indication of the presence of BS124 
antigen in the test sample. The antibody can be attached to a solid phase and may 
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be either a monoclonal or polyclonal antibody. Furthermore, the antibody 
specifically binds to at least one BS 124 antigen selected from the group consisting 
of SEQUENCE ID NO 22, SEQUENCE ID NO 23, SEQUENCE ID NO 24, 
SEQUENCE ID NO 25, and fragments thereof. 
5 Another method is provided which detects antibodies which specifically 

bind to BS124 antigen in a test sample suspected of containing these antibodies. 
The method comprises contacting the test sample with a polypeptide which 
contains at least one BS124 epitope, wherein the BS124 epitope comprises an 
amino acid sequence having at least 50% identity with an amino acid sequence 

10 encoded by a BS 124 polynucleotide, or a fragment thereof. Contacting is carried 
out for a time and under conditions sufficient to allow antigen/antibody complexes 
to form. The method further entails detecting complexes which contain the 
polypeptide. The polypeptide can be attached to a solid phase. Further, the 
polypeptide can be a recombinant protein or a synthetic peptide having at least 

1 5 50% identity with an amino acid sequence selected from the group consisting of 
SEQUENCE ID NO 22, SEQUENCE ID NO 23, SEQUENCE ID NO 24, 
SEQUENCE ID NO 25, and fragments thereof. 

The present invention provides a cell transfected with a BS 124 nucleic acid 
sequence that encodes at least one epitope of a BS124 antigen, or fragment 

20 thereof. The nucleic acid sequence is selected from the group consisting of 
SEQUENCE ID NO 1, SEQUENCE ID NO 2, SEQUENCE ID NO 3, 
SEQUENCE ID NO 4, SEQUENCE ID NO 5, and fragments or complements 
thereof. 

A method for producing antibodies to BS124 antigen also is provided, 
25 which method comprises administering to an individual an isolated immunogenic 
polypeptide or fragment thereof, wherein the isolated immunogenic polypeptide 
comprises at least one BS124 epitope. The immunogenic polypeptide is 
administered in an amount sufficient to produce an immune response. The 
isolated, immunogenic polypeptide comprises an amino acid sequence selected 
30 from the group consisting of SEQUENCE ID NO 22, SEQUENCE ID NO 23, 
SEQUENCE ID NO 24, SEQUENCE ID NO 25, and fragments thereof. 

Another method for producing antibodies which specifically bind to 
BS124 antigen is disclosed, which method comprises administering to an 
individual a plasmid comprising a nucleic acid sequence which encodes at least 
35 one BS124 epitope derived from an amino acid sequence selected from the group 
consisting of SEQUENCE ID NO 22, SEQUENCE ID NO 23, SEQUENCE ID 
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NO 24, SEQUENCE ID NO 25, and fragments thereof. The plasmid is 
administered in an amount such that the plasmid is taken up by cells in the 
individual and expressed at levels sufficient to produce an immune response. 
Also provided is a composition of matter that comprises a 

5 BS124 polynucleotide of at least about 10-12 nucleotides having at least 50% 
identity with a polynucleotide selected from the group consisting of (a) 
SEQUENCE ID NO 1 , SEQUENCE ID NO 2, SEQUENCE ID NO 4, 
SEQUENCE ID NO 5, and complements thereof, and (b) fragments of 
SEQUENCE ID NO 1, SEQUENCE ID NO 2, and SEQUENCE ID NO 3. The 

10 BS 1 24 polynucleotide encodes an amino acid sequence having at least one BS 1 24 
epitope. Another composition of matter provided by the present invention 
comprises a polypeptide with at least one BS124 epitope of about 8-10 amino 
acids. The polypeptide comprises an amino acid sequence having at least 50% 
identity with an amino acid sequence selected from the group consisting of 

15 SEQUENCE ID NO 22, SEQUENCE ID NO 23, SEQUENCE ID NO 24, 
SEQUENCE ID NO 25, and fragments thereof. Also provided is a gene, or a 
fragment thereof, coding for a BS 124 polypeptide which has at least 50% identity 
to SEQUENCE ID NO 22; and a gene, or a fragment thereof, comprising DNA 
having at least 50% identity with SEQUENCE ID NO 4 or SEQUENCE ID NO 5. 

20 

Brief Description of the Drawings 

Figures 1A-1B show the nucleotide alignment of clones 1730294 

(SEQUENCE ID NO 1), 1213903 (SEQUENCE ID NO 2), g2185139 

(SEQUENCE ID NO 3), the full-length sequence of clone 1730291 (designated as 
25 clone 1730294IH (SEQUENCE ID NO 4)), and the consensus sequence 

(SEQUENCE ID NO 5) derived therefrom. 

Figure 2 shows the contig map depicting the formation of the consensus 

nucleotide sequence (SEQUENCE ID NO 5) from the nucleotide alignment of 

overlapping clones 1730294 (SEQUENCE ID NO 1), 1213903 (SEQUENCE ID 
30 NO 2), g2 1 85 1 39 (SEQUENCE ID NO 3), and 1730294IH (SEQUENCE ID NO 

4). 

Figure 3 A is a scan of an ethidium bromide-stained agarose gel of RNA 
from various tissue extracts, and a corresponding Northern blot of the RNA using 
a radiolabeled BS124-specific probe. Figure 3B is a scan of an ethidium bromide- 
35 stained agarose gel of RNA from various breast tissue extracts, and a 
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corresponding Northern blot of the RNA using a radiolabeled BS124-specific 
probe. 

Figure 4A is a scan of a SYBR® Green-stained agarose gel of BS 124 RNA- 
specific RT-PCR amplification products from normal breast and cancerous breast 
5 tissue RNAs. Figure 4B is a scan of a SYBR® Green-stained agarose gel of BS 124 
RNA-specific RT-PCR amplification products from RNAs obtained from normal or 
cancerous placenta, lung, and colon tissues. 

Figure 5 A is a scan of a SYBR® Green-stained agarose gel of BS124 RNA- 
specific OH-PCR amplification products from a BS 124-specific transcript and T-47D 
10 breast cancer cells. Figure 5B depicts LCx Analyzer results from BS 124 RNA- 
specific OH-PCR amplification products from a diluted BS 124-specific transcript and 
T-47D breast cancer cells. 

Figures 6A-6B show the results of a Western blot performed on a panel of 
tissue protein extracts probed with antiserum against a BS124 synthetic peptide (the 
15 BS 124 peptide of SEQUENCE ID NO 23). 

Detailed Description of the Invention 

The present invention provides a gene, or a fragment thereof, which codes 
for a BS124 polypeptide having at least about 50% identity to SEQUENCE ED NO 

20 22. The present invention further encompasses a BS124 gene, or a fragment 

thereof, comprising DNA which has at least about 50% identity with SEQUENCE 
ID NO 4 or SEQUENCE ID NO 5. 

The present invention also provides methods for assaying a test sample for 
products of a breast tissue gene designated as BS124, which methods comprise 

25 making cDNA from mRNA in the test sample, and detecting the cDNA as an 

indication of the presence of breast tissue gene BS124. The methods may include 
an amplification step, wherein one or more portions of the mRNA from BS124 
corresponding to the gene or fragments thereof, is amplified. Methods also are 
provided for assaying for the translation products of BS124. Test samples which 

30 may be assayed by the methods provided herein include tissues, cells, body fluids 
and secretions. The present invention also provides reagents such as 
oligonucleotide primers and polypeptides which are useful in performing these 
methods. 

Portions of the nucleic acid sequences disclosed herein are useful as 
35 primers for the reverse transcription of RNA or for the amplification of cDNA; or 
as probes to determine the presence of certain mRNA sequences in test samples. 
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Also disclosed are nucleic acid sequences which permit the production of encoded 
polypeptide sequences which are useful as standards or reagents in diagnostic 
immunoassays, as targets for pharmaceutical screening assays and/or as 
components or as target sites for various therapies. Monoclonal and polyclonal 
5 antibodies directed against at least one epitope contained within these polypeptide 
sequences are useful as delivery agents for therapeutic agents as well as for 
diagnostic tests and for screening for diseases or conditions associated with 
BS 124, especially breast cancer. Isolation of sequences of other portions of the 
gene of interest can be accomplished utilizing probes or PCR primers derived from 
10 these nucleic acid sequences. This allows additional probes of the mRNA or 

cDNA of interest to be established, as well as corresponding encoded polypeptide 
sequences. These additional molecules are useful in detecting, diagnosing, 
staging, monitoring, prognosticating, preventing or treating, or determining the 
predisposition to diseases and conditions of the breast, such as breast cancer, 
1 5 characterized by BS 1 24, as disclosed herein. 

Techniques for determining amino acid sequence "similarity" are well- 
known in the art. In general, "similarity" means the exact amino acid to amino 
acid comparison of two or more polypeptides at the appropriate place, where 
amino acids are identical or possess similar chemical and/or physical properties 
20 such as charge or hydrophobicity. A so-termed "percent similarity" then can be 
determined between the compared polypeptide sequences. Techniques for 
determining nucleic acid and amino acid sequence identity also are well known in 
the art and include determining the nucleotide sequence of the mRNA for that gene 
(usually via a cDNA intermediate) and determining the amino acid sequence 
25 encoded thereby, and comparing this to a second amino acid sequence. In 

general, "identity" refers to an exact nucleotide to nucleotide or amino acid to 
amino acid correspondence of two polynucleotides or polypeptide sequences, 
respectively. Two or more polynucleotide sequences can be compared by 
determining their "percent identity." Two or more amino acid sequences likewise 
30 can be compared by determining their "percent identity." The programs available 
in the Wisconsin Sequence Analysis Package, Version 8 (available from Genetics 
Computer Group, Madison, WI), for example, the GAP program, are capable of 
calculating both the identity between two polynucleotides and the identity and 
similarity between two polypeptide sequences, respectively. Other programs for 
35 calculating identity or similarity between sequences are known in the art. 
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The compositions and methods described herein will enable the 
identification of certain markers as indicative of a breast tissue disease or 
condition; the information obtained therefrom will aid in the detecting, diagnosing, 
staging, monitoring, prognosticating, preventing or treating, or determining 

5 diseases or conditions associated with BS 1 24, especially breast cancer. Test 
methods include, for example, probe assays which utilize the sequence(s) 
provided herein and which also may utilize nucleic acid amplification methods 
such as the polymerase chain reaction (PCR), the ligase chain reaction (LCR), and 
hybridization. In addition, the nucleotide sequences provided herein contain open 

10 reading frames from which an immunogenic epitope may be found. This epitope 
is believed to be unique to the disease state or condition associated with BS124. It 
also is thought that the polynucleotides or polypeptides and protein encoded by the 
BS124 gene are useful as a marker. This marker is either elevated in disease such 
as breast cancer, altered in disease such as breast cancer, or present as a normal 

15 protein but appearing in an inappropriate body compartment. The uniqueness of 
the epitope may be determined by (i) its immunological reactivity and specificity 
with antibodies directed against proteins and polypeptides encoded by the BS124 
gene, and (ii) its nonreactivity with any other tissue markers. Methods for 
determining immunological reactivity are well-known and include, but are not 

20 limited to, for example, radioimmunoassay (RIA), enzyme-linked 

immunoabsorbent assay (ELISA), hemagglutination (HA), fluorescence 
polarization immunoassay (FPIA), chemiluminescent immunoassay (CLIA) and 
others. Several examples of suitable methods are described herein. 

Unless otherwise stated, the following terms shall have the following 

25 meanings: 

A polynucleotide "derived from" or "specific for" a designated sequence 
refers to a polynucleotide sequence which comprises a contiguous sequence of 
approximately at least about 6 nucleotides, preferably at least about 8 nucleotides, 
more preferably at least about 10-12 nucleotides, and even more preferably at least 

30 about 15-20 nucleotides corresponding, i.e., identical or complementary to, a 
region of the designated nucleotide sequence. The sequence may be 
complementary or identical to a sequence which is unique to a particular 
polynucleotide sequence as determined by techniques known in the art. 
Comparisons to sequences in databanks, for example, can be used as a method to 

35 determine the uniqueness of a designated sequence. Regions from which 
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sequences may be derived, include but are not limited to, regions encoding 
specific epitopes, as well as non-translated and/or non-transcribed regions. 

The derived polynucleotide will not necessarily be derived physically from 
the nucleotide sequence of interest under study, but may be generated in any 
manner, including, but not limited to, chemical synthesis, replication, reverse 
transcription or transcription, which is based on the information provided by the 
sequence of bases in the region(s) from which the polynucleotide is derived. As 
such, it may represent either a sense or an antisense orientation of the original 
polynucleotide. In addition, combinations of regions corresponding to that of the 
designated sequence may be modified in ways known in the art to be consistent 
with the intended use. 

A "fragment" of a specified polynucleotide refers to a polynucleotide 
sequence which comprises a contiguous sequence of approximately at least about 
6 nucleotides, preferably at least about 8 nucleotides, more preferably at least 
about 10-12 nucleotides, and even more preferably at least about 15-20 
nucleotides corresponding, i.e., identical or complementary to, a region of the 
specified nucleotide sequence. 

The term "primer" denotes a specific oligonucleotide sequence which is 
complementary to a target nucleotide sequence and used to hybridize to the target 
nucleotide sequence. A primer serves as an initiation point for nucleotide 
polymerization catalyzed by either DNA polymerase, RNA polymerase or reverse 
transcriptase. 

The term "probe" denotes a defined nucleic acid segment (or nucleotide 
analog segment, e.g., PNA as defined hereinbelow) which can be used to identify 
a specific polynucleotide present in samples bearing the complementary sequence. 

"Encoded by" refers to a nucleic acid sequence which codes for a 
polypeptide sequence, wherein the polypeptide sequence or a portion thereof 
contains an amino acid sequence of at least 3 to 5 amino acids, more preferably at 
least 8 to 10 amino acids, and even more preferably at least 15 to 20 amino acids 
from a polypeptide encoded by the nucleic acid sequence. Also encompassed are 
polypeptide sequences which are immunologically identifiable with a polypeptide 
encoded by the sequence. Thus, a "polypeptide," "protein," or "amino acid" 
sequence has at least about 50% identity, preferably about 60% identity, more 
preferably about 75-85% identity, and most preferably about 90-95% or more 
identity to a BS124 amino acid sequence. Further, the BS124 "polypeptide," 
"protein," or "amino acid" sequence may have at least about 60% similarity, 
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preferably at least about 75% similarity, more preferably about 85% similarity, 
and most preferably about 95% or more similarity to a polypeptide or amino acid 
sequence of BS 124. This amino acid sequence can be selected from the group 
consisting of SEQUENCE ID NO 22, SEQUENCE ID NO 23, SEQUENCE ID 
NO 24, SEQUENCE ID NO 25, and fragments thereof. 

A "recombinant polypeptide," "recombinant protein," or "a polypeptide 
produced by recombinant techniques," which terms may be used interchangeably 
herein, describes a polypeptide which by virtue of its origin or manipulation is not 
associated with all or a portion of the polypeptide with which it is associated in 
nature and/or is linked to a polypeptide other than that to which it is linked in 
nature. A recombinant or encoded polypeptide or protein is not necessarily 
translated from a designated nucleic acid sequence. It also may be generated in 
any manner, including chemical synthesis or expression of a recombinant 

expression system. 

The term "synthetic peptide" as used herein means a polymeric form of 
amino acids of any length, which may be chemically synthesized by methods well- 
known to the routineer. These synthetic peptides are useful in various 
applications. 

The term "polynucleotide" as used herein means a polymeric form of 
nucleotides of any length, either ribonucleotides or deoxyribonucleotides. This 
term refers only to the primary structure of the molecule. Thus, the term includes 
double- and single-stranded DNA, as well as double- and single-stranded RNA. 
It also includes modifications, such as methylation or capping and unmodified 
forms of the polynucleotide. The terms "polynucleotide," "oligomer," 
"oligonucleotide," and "oligo" are used interchangeably herein. ^ 

"A sequence corresponding to a cDNA" means that the sequence contains a 
polynucleotide sequence that is identical or complementary to a sequence in the 
designated DNA. The degree (or "percent") of identity or complementarity to the 
cDNA will be approximately 50% or greater, preferably at least about 70% or 
greater, and more preferably at least about 90% or greater. The sequence that 
corresponds to the identified cDNA will be at least about 50 nucleotides in length, 
preferably at least about 60 nucleotides in length, and more preferably at least 
about 70 nucleotides in length. The correspondence between the gene or gene 
fragment of interest and the cDNA can be determined by methods known in the art 
and include, for example, a direct comparison of the sequenced material with the 
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cDNAs described, or hybridization and digestion with single strand nucleases, 
followed by size determination of the digested fragments. 

"Purified polynucleotide" refers to a polynucleotide of interest or fragment 
thereof which is essentially free, e.g., contains less than about 50%, preferably 
less than about 70%, and more preferably less than about 90%, of the protein with 
which the polynucleotide is naturally associated. Techniques for purifying 
polynucleotides of interest are well-known in the art and include, for example, 
disruption of the cell containing the polynucleotide with a chaotropic agent and 
separation of the polynucleotide(s) and proteins by ion-exchange chromatography, 
affinity chromatography and sedimentation according to density. 

"Purified polypeptide" or "purified protein" means a polypeptide of interest 
or fragment thereof which is essentially free of, e.g., contains less than about 
50%, preferably less than about 70%, and more preferably less than about 90%, 
cellular components with which the polypeptide of interest is naturally associated. 
Methods for purifying polypeptides of interest are known in the art. 

The term "isolated" means that the material is removed from its original 
environment (e.g., the natural environment if it is naturally occurring). For 
example, a naturally-occurring polynucleotide or polypeptide present in a living 
animal is not isolated, but the same polynucleotide or DNA or polypeptide, which 
is separated from some or all of the coexisting materials in the natural system, is 
isolated. Such polynucleotide could be part of a vector and/or such polynucleotide 
or polypeptide could be part of a composition, and still be isolated in that the 
vector or composition is not part of its natural environment. 

"Polypeptide" and "protein" are used interchangeably herein and indicate at 
least one molecular chain of amino acids linked through covalent and/or non- 
covalent bonds. The terms do not refer to a specific length of the product. Thus 
peptides, oligopeptides and proteins are included within the definition of 
polypeptide. The terms include post-translational modifications of the 
polypeptide, for example, glycosylations, acetylations, phosphorylations and the 
like. In addition, protein fragments, analogs, mutated or variant proteins, fusion 
proteins and the like are included within the meaning of polypeptide. 

A "fragment" of a specified polypeptide refers to an amino acid sequence 
which comprises at least about 3-5 amino acids, more preferably at least about 8- 
10 amino acids, and even more preferably at least about 15-20 amino acids derived 
from the specified polypeptide. 
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"Recombinant host cells," "host cells," "cells," "cell lines," "cell cultures," 
and other such terms denoting microorganisms or higher eukaryotic cell lines 
cultured as unicellular entities refer to cells which can be, or have been, used as 
recipients for recombinant vector or other transferred DNA, and include the 
original progeny of the original cell which has been transfected. 

As used herein "replicon" means any genetic element, such as a plasmid, a 
chromosome or a virus, that behaves as an autonomous unit of polynucleotide 
replication within a cell. 

A "vector" is a replicon in which another polynucleotide segment is 
attached, such as to bring about the replication and/or expression of the attached 
segment. 

The term "control sequence" refers to a polynucleotide sequence which is 
necessary to effect the expression of a coding sequence to which it is ligated. The 
nature of such control sequences differs depending upon the host organism. In 
prokaryotes, such control sequences generally include a promoter, a ribosomal 
binding site and terminators; in eukaryotes, such control sequences generally 
include promoters, terminators and, in some instances, enhancers. The term 
"control sequence" thus is intended to include at a minimum all components whose 
presence is necessary for expression, and also may include additional components 
whose presence is advantageous, for example, leader sequences. 

"Operably linked" refers to a situation wherein the components described 
are in a relationship permitting them to function in their intended manner. Thus, 
for example, a control sequence "operably linked" to a coding sequence is ligated 
in such a manner that expression of the coding sequence is achieved under 
conditions compatible with the control sequence. 

The term "open reading frame" or "ORF" refers to a region of a 
polynucleotide sequence which encodes a polypeptide. This region may represent 
a portion of a coding sequence or a total coding sequence. 

A "coding sequence" is a polynucleotide sequence which is transcribed 
into mRNA and translated into a polypeptide when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are 
determined by a translation start codon at the 5' -terminus and a translation stop 
codon at the 3' -terminus. A coding sequence can include, but is not limited to, 
mRNA, cDNA and recombinant polynucleotide sequences. 

The term "immunologically identifiable with/as" refers to the presence of 
epitope(s) and polypeptide(s) which also are present in and are unique to the 
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designated polypeptide(s). Immunological identity may be determined by 
antibody binding and/or competition in binding. These techniques are known to 
the routineer and also are described herein. The uniqueness of an epitope also can 
be determined by computer searches of known data banks, such as GenBank, for 
the polynucleotide sequence which encodes the epitope and by amino acid 
sequence comparisons with other known proteins. 

As used herein, "epitope" means an antigenic determinant of a polypeptide 
or protein. Conceivably, an epitope can comprise three amino acids in a spatial 
conformation which is unique to the epitope. Generally, an epitope consists of at 
least five such amino acids and more usually, it consists of at least eight to ten 
amino acids. Methods of examining spatial conformation are known in the art and 
include, for example, x-ray crystallography and two-dimensional nuclear magnetic 
resonance. 

A "conformational epitope" is an epitope that is comprised of a specific 
juxtaposition of amino acids in an immunologically recognizable structure, such 
amino acids being present on the same polypeptide in a contiguous or non- 
contiguous order or present on different polypeptides. 

A polypeptide is "immunologically reactive" with an antibody when it 
binds to an antibody due to antibody recognition of a specific epitope contained 
within the polypeptide. Immunological reactivity may be determined by antibody 
binding, more particularly, by the kinetics of antibody binding, and/or by 
competition in binding using as competitor(s) a known polypeptide(s) containing 
an epitope against which the antibody is directed. The methods for determining 
whether a polypeptide is immunologically reactive with an antibody are known in 
the art. 

As used herein, the term "immunogenic polypeptide containing an epitope 
of interest" means naturally occurring polypeptides of interest or fragments 
thereof, as well as polypeptides prepared by other means, for example, by 
chemical synthesis or the expression of the polypeptide in a recombinant 
organism. 

The term "transfection" refers to the introduction of an exogenous 
polynucleotide into a prokaryotic or eucaryotic host cell, irrespective of the method 
used for the introduction. The term "transfection" refers to both stable and 
transient introduction of the polynucleotide, and encompasses direct uptake of 
polynucleotides, transformation, transduction, and f-mating. Once introduced into 
the host cell, the exogenous polynucleotide may be maintained as a non-integrated 



Atty Dkt No. 6120.US.P1 
PATENT 



-19- 

replicon, for example, a plasmid, or alternatively, may be integrated into the host 
genome. 

"Treatment" refers to prophylaxis and/or therapy. 

The term "individual" as used herein refers to vertebrates, particularly 
members of the mammalian species and includes, but is not limited to, domestic 
animals, sports animals, primates and humans; more particularly, the term refers 
to humans. 

The term "sense strand" or "plus strand" (or "+") as used herein denotes a 
nucleic acid that contains the sequence that encodes the polypeptide. The term 
"antisense strand" or "minus strand" (or "-") denotes a nucleic acid that contains a 
sequence that is complementary to that of the "plus" strand. 

The term "test sample" refers to a component of an individual's body 
which is the source of the analyte (such as antibodies of interest or antigens of 
interest). These components are well known in the art. A test sample is typically 
anything suspected of containing a target sequence. Test samples can be prepared 
using methodologies well known in the art such as by obtaining a specimen from 
an individual and, if necessary, disrupting any cells contained thereby to release 
target nucleic acids. These test samples include biological samples which can be 
tested by the methods of the present invention described herein and include human 
and animal body fluids such as whole blood, serum, plasma, cerebrospinal fluid, 
sputum, bronchial washing, bronchial aspirates, urine, lymph fluids, and various 
external secretions of the respiratory, intestinal and genitourinary tracts, tears, 
saliva, milk, white blood cells, myelomas and the like; biological fluids such as 
cell culture supematants; tissue specimens which may be fixed; and cell specimens 

which may be fixed. 

"Purified product" refers to a preparation of the product which has been 
isolated from the cellular constituents with which the product is normally 
associated and from other types of cells which may be present in the sample of 
interest. 

"PNA" denotes a "peptide nucleic acid analog" which may be utilized in a 
procedure such as an assay described herein to determine the presence of a target. 
"MA" denotes a "morpholino analog" which may be utilized in a procedure such 
as an assay described herein to determine the presence of a target. See, for 
example, U.S. Patent No. 5,378,841, which is incorporated herein by reference. 
PNAs are neutrally charged moieties which can be directed against RNA targets or 
DNA. PNA probes used in assays in place of, for example, the DNA probes of 



Atty Dkt No. 6120.US.P1 
PATENT 



-20- 

the present invention, offer advantages not achievable when DNA probes are 
used. These advantages include manufacturability, large scale labeling, 
reproducibility, stability, insensitivity to changes in ionic strength and resistance to 
enzymatic degradation which is present in methods utilizing DNA or RNA. These 
PNAs can be labeled with ("attached to") such signal generating compounds as 
fluorescein, radionucleotides, chemiluminescent compounds and the like. PNAs 
or other nucleic acid analogs such as MAs thus can be used in assay methods in 
place of DNA or RNA. Although assays are described herein utilizing DNA 
probes, it is within the scope of the routineer that PNAs or MAs can be substituted 
for RNA or DNA with appropriate changes if and as needed in assay reagents. 

"Analyte," as used herein, is the substance to be detected which may be 
present in the test sample. The analyte can be any substance for which there exists 
a naturally occurring specific binding member (such as an antibody), or for which 
a specific binding member can be prepared. Thus, an analyte is a substance that 
can bind to one or more specific binding members in an assay. "Analyte" also 
includes any antigenic substances, haptens, antibodies and combinations thereof. 
As a member of a specific binding pair, the analyte can be detected by means of 
naturally occurring specific binding partners (pairs) such as the use of intrinsic 
factor protein as a member of a specific binding pair for the determination of 
Vitamin B 12, the use of folate-binding protein to determine folic acid, or the use 
of a lectin as a member of a specific binding pair for the determination of a 
carbohydrate. The analyte can include a protein, a polypeptide, an amino acid, a 
nucleotide target and the like. The analyte can be soluble in a body fluid such as 
blood, blood plasma or serum, urine or the like. The analyte can be in a tissue, 
either on a cell surface or within a cell. The analyte can be on or in a cell dispersed 
in a body fluid such as blood, urine, breast aspirate, or obtained as a biopsy 
sample. 

The terms "diseases of the breast," "breast disease," and "condition of the 
breast" are used interchangeably herein to refer to any disease or condition of the 
breast including, but not limited to, atypical hyperplasia, fibroadenoma, cystic 
breast disease, and cancer. 

"Breast cancer," as used herein, refers to any malignant disease of the 
breast including, but not limited to, ductal carcinoma in situ, lobular carcinoma in 
situ, infiltrating ductal carcinoma, medullary carcinoma, tubular carcinoma, 
mucinous carcinoma, infiltrating lobular carcinoma, infiltrating comedocarcinoma 
and inflammatory carcinoma. 
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An "Expressed Sequence Tag" or "EST" refers to the partial sequence of a 
cDNA insert which has been made by reverse transcription of mRNA extracted 
from a tissue followed by insertion into a vector. 

A "transcript image" refers to a table or list giving the quantitative 
distribution of ESTs in a library and represents the genes active in the tissue from 
which the library was made. 

The present invention provides assays which utilize specific binding 
members. A "specific binding member," as used herein, is a member of a specific 
binding pair. That is, two different molecules where one of the molecules, 
through chemical or physical means, specifically binds to the second molecule. 
Therefore, in addition to antigen and antibody specific binding pairs of common 
immunoassays, other specific binding pairs can include biotin and avidin, 
carbohydrates and lectins, complementary nucleotide sequences, effector and 
receptor molecules, cofactors and enzymes, enzyme inhibitors, and enzymes and 
the like. Furthermore, specific binding pairs can include members that are analogs 
of the original specific binding members, for example, an analyte-analog. 
Immunoreactive specific binding members include antigens, antigen fragments, 
antibodies and antibody fragments, both monoclonal and polyclonal and 
complexes thereof, including those formed by recombinant DNA molecules. 

The term "hapten," as used herein, refers to a partial antigen or non-protein 
binding member which is capable of binding to an antibody, but which is not 
capable of eliciting antibody formation unless coupled to a carrier protein. 

A "capture reagent," as used herein, refers to an unlabeled specific binding 
member which is specific either for the analyte as in a sandwich assay, for the 
indicator reagent or analyte as in a competitive assay, or for an ancillary specific 
binding member, which itself is specific for the analyte, as in an indirect assay. 
The capture reagent can be directly or indirectly bound to a solid phase material 
before the performance of the assay or during the performance of the assay, 
thereby enabling the separation of immobilized complexes from the test sample. 

The "indicator reagent" comprises a "signal-generating compound" 
("label") which is capable of generating and generates a measurable signal 
detectable by external means, conjugated ("attached") to a specific binding 
member. In addition to being an antibody member of a specific binding pair, the 
indicator reagent also can be a member of any specific binding pair, including 
either hapten-anti-hapten systems such as biotin or anti-biotin, avidin or biotin, a 
carbohydrate or a lectin, a complementary nucleotide sequence, an effector or a 
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receptor molecule, an enzyme cofactor and an enzyme, an enzyme inhibitor or an 
enzyme and the like. An immunoreactive specific binding member can be an 
antibody, an antigen, or an antibody/antigen complex that is capable of binding 
either to the polypeptide of interest as in a sandwich assay, to the capture reagent 
5 as in a competitive assay, or to the ancillary specific binding member as in an 
indirect assay. When describing probes and probe assays, the term "reporter 
molecule" may be used. A reporter molecule comprises a signal generating 
compound as described hereinabove conjugated to a specific binding member of a 
specific binding pair, such as carbazole or adamantane. 
10 The various "signal-generating compounds" (labels) contemplated include 

chromagens, catalysts such as enzymes, luminescent compounds such as 
fluorescein and rhodamine, chemiluminescent compounds such as dioxetanes, 
acridiniums, phenanthridiniums and luminol, radioactive elements and direct 
visual labels. Examples of enzymes include alkaline phosphatase, horseradish 
1 5 peroxidase, beta-galactosidase and the like. The selection of a particular label is 
not critical, but it must be capable of producing a signal either by itself or in 
conjunction with one or more additional substances. 

"Solid phases" ("solid supports") are known to those in the art and include 
the walls of wells of a reaction tray, test tubes, polystyrene beads, magnetic or 
20 non-magnetic beads, nitrocellulose strips, membranes, microparticles such as latex 
particles, sheep (or other animal) red blood cells and Duracytes® (red blood cells 
"fixed" by pyruvic aldehyde and formaldehyde, available from Abbott 
Laboratories, Abbott Park, IL) and others. The "solid phase" is not critical and 
can be selected by one skilled in the art. Thus, latex particles, microparticles, 
25 magnetic or non-magnetic beads, membranes, plastic tubes, walls of microtiter 

wells, glass or silicon chips, sheep (or other suitable animal's) red blood cells and 
Duracytes® are all suitable examples. Suitable methods for immobilizing peptides 
on solid phases include ionic, hydrophobic, covalent interactions and the like. A 
"solid phase," as used herein, refers to any material which is insoluble, or can be 
30 made insoluble by a subsequent reaction. The solid phase can be chosen for its 
intrinsic ability to attract and immobilize the capture reagent. Alternatively, the 
solid phase can retain an additional receptor which has the ability to attract and 
immobilize the capture reagent. The additional receptor can include a charged 
substance that is oppositely charged with respect to the capture reagent itself or to 
35 a charged substance conjugated to the capture reagent. As yet another alternative, 
the receptor molecule can be any specific binding member which is immobilized 
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upon (attached to) the solid phase and which has the ability to immobilize the 
capture reagent through a specific binding reaction. The receptor molecule enables 
the indirect binding of the capture reagent to a solid phase material before the 
performance of the assay or during the performance of the assay. The solid phase 

5 thus can be a plastic, derivatized plastic, magnetic or non-magnetic metal, glass or 
silicon surface of a test tube, microtiter well, sheet, bead, microparticle, chip, 
sheep (or other suitable animal's) red blood cells, Duracytes® and other 
configurations known to those of ordinary skill in the art. 

It is contemplated and within the scope of the present invention that the 

1 0 solid phase also can comprise any suitable porous material with sufficient porosity 
to allow access by detection antibodies and a suitable surface affinity to bind 
antigens. Microporous structures generally are preferred, but materials with a gel 
structure in the hydrated state may be used as well. Such useful solid supports 
include, but are not limited to, nitrocellulose and nylon. It is contemplated that 

15 such porous solid supports described herein preferably are in the form of sheets of 
thickness from about 0.01 to 0.5 mm, preferably about 0.1 mm. The pore size 
may vary within wide limits and preferably is from about 0.025 to 15 microns, 
especially from about 0.15 to 15 microns. The surface of such supports may be 
activated by chemical processes which cause covalent linkage of the antigen or 

20 antibody to the support. The irreversible binding of the antigen or antibody is 
obtained, however, in general, by adsorption on the porous material by poorly 
understood hydrophobic forces. Other suitable solid supports are known in the 
art. 

Reagents . 

25 The present invention provides reagents such as polynucleotide sequences 

derived from a breast tissue of interest and designated as BS 124, polypeptides 
encoded thereby and antibodies specific for these polypeptides. The present 
invention also provides reagents such as oligonucleotide fragments derived from 
the disclosed polynucleotides and nucleic acid sequences complementary to these 

30 polynucleotides. The polynucleotides, polypeptides, or antibodies of the present 
invention may be used to provide information leading to the detecting, diagnosing, 
staging, monitoring, prognosticating, preventing or treating of, or determining the 
predisposition to, diseases and conditions of the breast, such as breast cancer. 
The sequences disclosed herein represent unique polynucleotides which can be 

35 used in assays or for producing a specific profile of gene transcription activity. 
Such assays are disclosed in European Patent Number 0373203B1 and 
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International Publication No. WO 95/1 1995, which are hereby incorporated by 
reference. 

Selected BS124-derived polynucleotides can be used in the methods 
described herein for the detection of normal or altered gene expression. Such 
methods may employ BS124 polynucleotides or oligonucleotides, fragments or 
derivatives thereof, or nucleic acid sequences complementary thereto. 

The polynucleotides disclosed herein, their complementary sequences, or 
fragments of either, can be used in assays to detect, amplify or quantify genes, 
nucleic acids, cDNAs or mRNAs relating to breast tissue disease and conditions 
associated therewith. They also can be used to identify an entire or partial coding 
region of a BS124 polypeptide. They further can be provided in individual 
containers in the form of a kit for assays, or provided as individual compositions. 
If provided in a kit for assays, other suitable reagents such as buffers, conjugates 
and the like may be included. 

The polynucleotide may be in the form of RNA or DNA. Polynucleotides 
in the form of DNA, cDNA, genomic DNA, nucleic acid analogs and synthetic 
DNA are within the scope of the present invention. The DNA may be double- 
stranded or single-stranded, and if single stranded, may be the coding (sense) 
strand or non-coding (anti-sense) strand. The coding sequence which encodes the 
polypeptide may be identical to the coding sequence provided herein or may be a 
different coding sequence which coding sequence, as a result of the redundancy or 
degeneracy of the genetic code, encodes the same polypeptide as the DNA 
provided herein. 

This polynucleotide may include only the coding sequence for the 
polypeptide, or the coding sequence for the polypeptide and an additional coding 
sequence such as a leader or secretory sequence or a proprotein sequence, or the 
coding sequence for the polypeptide (and optionally an additional coding 
sequence) and non-coding sequence, such as a non-coding sequence 5' and/or 3' 
of the coding sequence for the polypeptide. 

In addition, the invention includes variant polynucleotides containing 
modifications such as polynucleotide deletions, substitutions or additions; and any 
polypeptide modification resulting from the variant polynucleotide sequence. A 
polynucleotide of the present invention also may have a coding sequence which is 
a naturally occurring allelic variant of the coding sequence provided herein. 

In addition, the coding sequence for the polypeptide may be fused in the 
same reading frame to a polynucleotide sequence which aids in expression and 
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secretion of a polypeptide from a host cell, for example, a leader sequence which 
functions as a secretory sequence for controlling transport of a polypeptide from 
the cell. The polypeptide having a leader sequence is a preprotein and may have 
the leader sequence cleaved by the host cell to form the polypeptide. The 
5 polynucleotides may also encode for a proprotein which is the protein plus 

additional 5' amino acid residues. A protein having a prosequence is a proprotein 
and may, in some cases, be an inactive form of the protein. Once the prosequence 
is cleaved, an active protein remains. Thus, the polynucleotide of the present 
invention may encode for a protein, or for a protein having a prosequence, or for a 

10 protein having both a presequence (leader sequence) and a prosequence. 

The polynucleotides of the present invention may also have the coding 
sequence fused in frame to a marker sequence which allows for purification of the 
polypeptide of the present invention. The marker sequence may be a hexa- 
histidine tag supplied by a pQE-9 vector to provide for purification of the 

1 5 polypeptide fused to the marker in the case of a bacterial host, or, for example, the 
marker sequence may be a hemagglutinin (HA) tag when a mammalian host, e.g. a 
COS-7 cell line, is used. The HA tag corresponds to an epitope derived from the 
influenza hemagglutinin protein. See, for example, I. Wilson et al., Cell 37:767 
(1984). 

20 It is contemplated that polynucleotides will be considered to hybridize to 

the sequences provided herein if there is at least 50%, preferably at least 70%, and 
more preferably at least 90% identity between the polynucleotide and the 
sequence. 

The present invention also provides an antibody produced by using a 
25 purified BS 124 polypeptide of which at least a portion of the polypeptide is 

encoded by a BS124 polynucleotide selected from the polynucleotides provided 
herein. These antibodies may be used in the methods provided herein for the 
detection of BS 124 antigen in test samples. The presence of BS 124 antigen in the 
test samples is indicative of the presence of a breast disease or condition. The 
30 antibody also may be used for therapeutic purposes, for example, in neutralizing 
the activity of BS124 polypeptide in conditions associated with altered or 
abnormal expression. 

The present invention further relates to a BS124 polypeptide which has the 
deduced amino acid sequence as provided herein, as well as fragments, analogs 
35 and derivatives of such polypeptide. The polypeptide of the present invention may 
be a recombinant polypeptide, a natural purified polypeptide or a synthetic 



Atty Dkt No. 6120.US.P1 
PATENT 



-26- 

polypeptide. The fragment, derivative or analog of the BS 124 polypeptide may be 
one in which one or more of the amino acid residues is substituted with a 
conserved or non-conserved amino acid residue (preferably a conserved amino 
acid residue) and such substituted amino acid residue may or may not be one 
5 encoded by the genetic code; or it may be one in which one or more of the amino 
acid residues includes a substituent group; or it may be one in which the 
polypeptide is fused with another compound, such as a compound to increase the 
half-life of the polypeptide (for example, polyethylene glycol); or it may be one in 
which the additional amino acids are fused to the polypeptide, such as a leader or 

10 secretory sequence or a sequence which is employed for purification of the 

polypeptide or a proprotein sequence. Such fragments, derivatives and analogs 
are within the scope of the present invention. The polypeptides and 
polynucleotides of the present invention are provided preferably in an isolated 
form and preferably purified. 

1 5 Thus, a polypeptide of the present invention may have an amino acid 

sequence that is identical to that of the naturally occurring polypeptide or that is 
different by minor variations due to one or more amino acid substitutions. The 
variation may be a "conservative change" typically in the range of about 1 to 5 
amino acids, wherein the substituted amino acid has similar structural or chemical 

20 properties, e.g., replacement of leucine with isoleucine or threonine with serine. 
In contrast, variations may include nonconservative changes, e.g., replacement of 
a glycine with a tryptophan. Similar minor variations may also include amino acid 
deletions or insertions, or both. Guidance in determining which and how many 
amino acid residues may be substituted, inserted or deleted without changing 

25 biological or immunological activity may be found using computer programs well 
known in the art, for example, DNASTAR software (DNASTAR Inc., Madison 
WI). 

Probes constructed according to the polynucleotide sequences of the 
present invention can be used in various assay methods to provide various types 

30 of analysis. For example, such probes can be used in fluorescent in situ 

hybridization (FISH) technology to perform chromosomal analysis, and used to 
identify cancer-specific structural alterations in the chromosomes, such as 
deletions or translocations that are visible from chromosome spreads or detectable 
using PCR-generated and/or allele specific oligonucleotides probes, allele specific 

35 amplification or by direct sequencing. Probes also can be labeled with 

radioisotopes, directly- or indirectly- detectable haptens, or fluorescent molecules, 
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and utilized for in situ hybridization studies to evaluate the mRNA expression of 
the gene comprising the polynucleotide in tissue specimens or cells. 

This invention also provides teachings as to the production of the 
polynucleotides and polypeptides provided herein. 
5 Probe Assays 

The sequences provided herein may be used to produce probes which can 
be used in assays for the detection of nucleic acids in test samples. The probes 
may be designed from conserved nucleotide regions of the polynucleotides of 
interest or from non-conserved nucleotide regions of the polynucleotide of 
10 interest. The design of such probes for optimization in assays is within the skill of 
the routineer. Generally, nucleic acid probes are developed from non-conserved 
or unique regions when maximum specificity is desired, and nucleic acid probes 
are developed from conserved regions when assaying for nucleotide regions that 
are closely related to, for example, different members of a multi-gene family or in 
15 related species like mouse and man. 

The polymerase chain reaction (PCR) is a technique for amplifying a 
desired nucleic acid sequence (target) contained in a nucleic acid or mixture 
thereof. In PCR, a pair of primers are employed in excess to hybridize to the 
complementary strands of the target nucleic acid. The primers are each extended 
20 by a polymerase using the target nucleic acid as a template. The extension 

products become target sequences themselves, following dissociation from the 
original target strand. New primers then are hybridized and extended by a 
polymerase, and the cycle is repeated to geometrically increase the number of 
target sequence molecules. PCR is disclosed in U.S. Patents 4,683,195 and 
25 4,683,202, which are incorporated herein by reference. 

The Ligase Chain Reaction (LCR) is an alternate method for nucleic acid 
amplification. In LCR, probe pairs are used which include two primary (first and 
second) and two secondary (third and fourth) probes, all of which are employed in 
molar excess to target. The first probe hybridizes to a first segment of the target 
30 strand, and the second probe hybridizes to a second segment of the target strand, 
the first and second segments being contiguous so that the primary probes abut 
one another in 5' phosphate-3' hydroxyl relationship, and so that a ligase can 
covalently fuse or ligate the two probes into a fused product. In addition, a third 
(secondary) probe can hybridize to a portion of the first probe and a fourth 
35 (secondary) probe can hybridize to a portion of the second probe in a similar 
abutting fashion. Of course, if the target is initially double stranded, the 
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secondary probes also will hybridize to the target complement in the first instance. 
Once the ligated strand of primary probes is separated from the target strand, it 
will hybridize with the third and fourth probes which can be ligated to form a 
complementary, secondary ligated product. It is important to realize that the 
5 ligated products are functionally equivalent to either the target or its complement. 
By repeated cycles of hybridization and ligation, amplification of the target 
sequence is achieved. This technique is described more completely in EP-A- 320 
308 to K. Backman published June 16, 1989 and EP-A-439 182 to K. Backman 
et al., published July 3 1 , 1991 , both of which are incorporated herein by 
10 reference. 

For amplification of mRNAs, it is within the scope of the present invention 
to reverse transcribe mRNA into cDNA followed by polymerase chain reaction 
(RT-PCR); or, to use a single enzyme for both steps as described in U.S. Patent 
No. 5,322,770, which is incorporated herein by reference; or reverse transcribe 
1 5 mRNA into cDNA followed by asymmetric gap ligase chain reaction (RT- 

AGLCR) as described by R.L. Marshall et al.. PCR Methods and A pplications 4: 
80-84 (1994), which also is incorporated herein by reference. 

Other known amplification methods which can be utilized herein include 
but are not limited to the so-called "NASBA" or "3SR" technique described by 
20 J.C. Guatelli et al., PNAS USA 87:1874-1878 (1990) and also described by J. 

Compton, Nature 350 (No. 6313):91-92 (1991); Q-beta amplification as described 
in published European Patent Application (EPA) No. 4544610; strand 
displacement amplification (as described in G.T. Walker et al., Clin. Chem. 42:9- 
13 [1996]) and European Patent Application No. 684315; and target mediated 
25 amplification, as described in International Publication No. WO 93/22461. 

Detection of BS 124 may be accomplished using any suitable detection 
method, including those detection methods which are currently well known in the 
art, as well as detection strategies which may evolve later. Examples of the 
foregoing presently known detection methods are hereby incorporated herein by 
30 reference. See, for example, Caskey et al., U.S. Patent No. 5,582,989, Gelfand 
et al., U.S. Patent No. 5,210,015. Examples of such detection methods include 
target amplification methods as well as signal amplification technologies. An 
example of presently known detection methods would include the nucleic acid 
amplification technologies referred to as PCR, LCR, NASBA, SDA, RCR and 
35 TMA. See, for example, Caskey et al., U.S. Patent No. 5,582,989, Gelfand et 
al., U.S. Patent No. 5,210,015. All of the foregoing are hereby incorporated by 
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reference. Detection may also be accomplished using signal amplification such as 
that disclosed in Snitman et al., U.S. Patent No. 5,273,882. While the 
amplification of target or signal is preferred at present, it is contemplated and 
within the scope of the present invention that ultrasensitive detection methods 
5 which do not require amplification can be utilized herein. 

Detection, both amplified and non-amplified, may be (combined) carried 
out using a variety of heterogeneous and homogeneous detection formats. 
Examples of heterogeneous detection formats are disclosed in Snitman et al., U.S. 
Patent No. 5,273,882, Albarella et al. in EP-841 14441.9, Urdea et al., U.S. 

10 Patent No. 5,124,246, Ullman et al. U.S. Patent No. 5,185,243 and Kourilsky et 
al., U.S. Patent No. 4,581,333. All of the foregoing are hereby incorporated by 
reference. Examples of homogeneous detection formats are disclosed in, Caskey 
et al., U.S. Patent No. 5,582,989, Gelfand et al., U.S. Patent No. 5,210,015, 
which are incorporated herein by reference. Also contemplated and within the 

1 5 scope of the present invention is the use of multiple probes in the hybridization 
assay, which use improves sensitivity and amplification of the BS124 signal. 
See, for example, Caskey et al., U.S. Patent No. 5,582,989, Gelfand et al., U.S. 
Patent No. 5,210,015, which are incorporated herein by reference. 

In one embodiment, the present invention generally comprises the steps of 

20 contacting a test sample suspected of containing a target polynucleotide sequence 
with amplification reaction reagents comprising an amplification primer, and a 
detection probe that can hybridize with an internal region of the amplicon 
sequences. Probes and primers employed according to the method provided 
herein are labeled with capture and detection labels, wherein probes are labeled 

25 with one type of label and primers are labeled with another type of label. 

Additionally, the primers and probes are selected such that the probe sequence has 
a lower melt temperature than the primer sequences. The amplification reagents, 
detection reagents and test sample are placed under amplification conditions 
whereby, in the presence of target sequence, copies of the target sequence (an 

30 amplicon) are produced. In the usual case, the amplicon is double stranded 

because primers are provided to amplify a target sequence and its complementary 
strand. The double stranded amplicon then is thermally denatured to produce 
single stranded amplicon members. Upon formation of the single stranded 
amplicon members, the mixture is cooled to allow the formation of complexes 

35 between the probes and single stranded amplicon members. 
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As the single stranded amplicon sequences and probe sequences are 
cooled, the probe sequences preferentially bind the single stranded amplicon 
members. This finding is counterintuitive given that the probe sequences 
generally are selected to be shorter than the primer sequences and therefore have a 
5 lower melt temperature than the primers. Accordingly, the melt temperature of the 
amplicon produced by the primers should also have a higher melt temperature than 
the probes. Thus, as the mixture cools, the re-formation of the double stranded 
amplicon would be expected. As previously stated, however, this is not the case. 
The probes are found to preferentially bind the single stranded amplicon members. 

10 Moreover, this preference of probe/single stranded amplicon binding exists even 
when the primer sequences are added in excess of the probes. 

After the probe/single stranded amplicon member hybrids are formed, they 
are detected. Standard heterogeneous assay formats are suitable for detecting the 
hybrids using the detection labels and capture labels present on the primers and 

1 5 probes. The hybrids can be bound to a solid phase reagent by virtue of the capture 
label and detected by virtue of the detection label. In cases where the detection 
label is directly detectable, the presence of the hybrids on the solid phase can be 
detected by causing the label to produce a detectable signal, if necessary, and 
detecting the signal. In cases where the label is not directly detectable, the 

20 captured hybrids can be contacted with a conjugate, which generally comprises a 
binding member attached to a directly detectable label. The conjugate becomes 
bound to the complexes and the conjugate's presence on the complexes can be 
detected with the directly detectable label. Thus, the presence of the hybrids on 
the solid phase reagent can be determined. Those skilled in the art will recognize 

25 that wash steps may be employed to wash away unhybridized amplicon or probe 
as well as unbound conjugate. 

Although the target sequence is described as single stranded, it also is 
contemplated to include the case where the target sequence is actually double 
stranded but is merely separated from its complement prior to hybridization with 

30 the amplification primer sequences. In the case where PCR is employed in this 

method, the ends of the target sequences are usually known. In cases where LCR 
or a modification thereof is employed in the preferred method, the entire target 
sequence is usually known. Typically, the target sequence is a nucleic acid 
sequence such as, for example, RNA or DNA. 

35 The method provided herein can be used in well-known amplification 

reactions that include thermal cycle reaction mixtures, particularly in PCR and gap 
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LCR (GLCR). Amplification reactions typically employ primers to repeatedly 
generate copies of a target nucleic acid sequence, which target sequence is usually 
a small region of a much larger nucleic acid sequence. Primers are themselves 
nucleic acid sequences that are complementary to regions of a target sequence. 
5 Under amplification conditions, these primers hybridize or bind to the 

complementary regions of the target sequence. Copies of the target sequence 
typically are generated by the process of primer extension and/or ligation which 
utilizes enzymes with polymerase or ligase activity, separately or in combination, 
to add nucleotides to the hybridized primers and/or ligate adjacent probe pairs. 

10 The nucleotides that are added to the primers or probes, as monomers or 

preformed oligomers, are also complementary to the target sequence. Once the 
primers or probes have been sufficiently extended and/or ligated, they are 
separated from the target sequence, for example, by heating the reaction mixture to 
a "melt temperature" which is one in which complementary nucleic acid strands 

15 dissociate. Thus, a sequence complementary to the target sequence is formed. 

A new amplification cycle then can take place to further amplify the 
number of target sequences by separating any double stranded sequences, 
allowing primers or probes to hybridize to their respective targets, extending 
and/or ligating the hybridized primers or probes and re-separating. The 

20 complementary sequences that are generated by amplification cycles can serve as 
templates for primer extension or filling the gap of two probes to further amplify 
the number of target sequences. Typically, a reaction mixture is cycled between 
20 and 100 times, more typically, a reaction mixture is cycled between 25 and 50 
times. The numbers of cycles can be determined by the routineer. In this manner, 

25 multiple copies of the target sequence and its complementary sequence are 

produced. Thus, primers initiate amplification of the target sequence when it is 
present under amplification conditions. 

Generally, two primers which are complementary to a portion of a target 
strand and its complement are employed in PCR. For LCR, four probes, two of 

30 which are complementary to a target sequence and two of which are similarly 

complementary to the target's complement, are generally employed. In addition to 
the primer sets and enzymes previously mentioned, a nucleic acid amplification 
reaction mixture may also comprise other reagents which are well known and 
include but are not limited to: enzyme cofactors such as manganese; magnesium; 

35 salts; nicotinamide adenine dinucleotide (NAD); and deoxynucleotide 

triphosphates (dNTPs) such as, for example, deoxyadenine triphosphate, 
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deoxyguanine triphosphate, deoxycytosine triphosphate and deoxythymine 
triphosphate. 

While the amplification primers initiate amplification of the target 
sequence, the detection (or hybridization) probe is not involved in amplification. 
5 Detection probes are generally nucleic acid sequences or uncharged nucleic acid 
analogs such as, for example, peptide nucleic acids which are disclosed in 
International Publication No. WO 92/20702; morpholino analogs which are 
described in U.S. Patents Nos 5,185,444, 5,034,506 and 5,142,047; and the 
like. Depending upon the type of label carried by the probe, the probe is 

10 employed to capture or detect the amplicon generated by the amplification reaction. 
The probe is not involved in amplification of the target sequence and therefore may 
have to be rendered "non-extendible" in that additional dNTPs cannot be added to 
the probe. In and of themselves, analogs usually are non-extendible and nucleic 
acid probes can be rendered non-extendible by modifying the 3' end of the probe 

15 such that the hydroxy I group is no longer capable of participating in elongation. 
For example, the 3' end of the probe can be functionalized with the capture or 
detection label to thereby consume or otherwise block the hydroxyl group. 
Alternatively, the 3' hydroxyl group simply can be cleaved, replaced or modified. 
U.S. Patent Application Serial No. 07/049,061 filed April 19, 1993 and 

20 incorporated herein by reference describes modifications which can be used to 
render a probe non-extendible. 

The ratio of primers to probes is not important. Thus, either the probes or 
primers can be added to the reaction mixture in excess whereby the concentration 
of one would be greater than the concentration of the other. Alternatively, primers 

25 and probes can be employed in equivalent concentrations. Preferably, however, 
the primers are added to the reaction mixture in excess of the probes. Thus, 
primer to probe ratios of, for example, 5:1 and 20:1, are preferred. 

While the length of the primers and probes can vary, the probe sequences 
are selected such that they have a lower melt temperature than the primer 

30 sequences. Hence, the primer sequences are generally longer than the probe 

sequences. Typically, the primer sequences are in the range of between 20 and 50 
nucleotides long, more typically in the range of between 20 and 30 nucleotides 
long. The typical probe is in the range of between 10 and 25 nucleotides long. 

Various methods for synthesizing primers and probes are well known in 

35 the art. Similarly, methods for attaching labels to primers or probes are also well 
known in the art. For example, it is a matter of routine to synthesize desired 
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nucleic acid primers or probes using conventional nucleotide phosphoramidite 
chemistry and instruments available from Applied Biosystems, Inc., (Foster City, 
CA), DuPont (Wilmington, DE), or Milligen (Bedford MA). Many methods have 
been described for labeling oligonucleotides such as the primers or probes of the 
5 present invention. Enzo Biochemical (New York, NY) and Clontech (Palo Alto, 
CA) both have described and commercialized probe labeling techniques. For 
example, a primary amine can be attached to a 3' oligo terminus using 3'-Amine- 
ON CPG™ (Clontech, Palo Alto, CA). Similarly, a primary amine can be 
attached to a 5' oligo terminus using Aminomodifier II® (Clontech). The amines 

10 can be reacted to various haptens using conventional activation and linking 

chemistries. In addition, copending applications U.S. Serial Nos. 625,566, filed 
December 11, 1990 and 630,908, filed December 20, 1990, which are each 
incorporated herein by reference, teach methods for labeling probes at their 5' and 
3' termini, respectively. International Publication Nos WO 92/10505, published 

15 25 June 1992, and WO 92/1 1388, published 9 July 1992, teach methods for 
labeling probes at their 5' and 3' ends, respectively. According to one known 
method for labeling an oligonucleotide, a label-phosphoramidite reagent is 
prepared and used to add the label to the oligonucleotide during its synthesis. See, 
for example, N.T. Thuong et al., Tet. Letters 29(46):5905-5908 (1988); or J.S. 

20 Cohen et al., published U.S. Patent Application 07/246,688 (NTIS ORDER No. 
PAT-APPL-7-246,688) (1989). Preferably, probes are labeled at their 3' and 5' 
ends. 

A capture label is attached to the primers or probes and can be a specific 
binding member which forms a binding pair with the solid phase reagent's specific 

25 binding member. It will be understood that the primer or probe itself may serve as 
the capture label. For example, in the case where a solid phase reagent's binding 
member is a nucleic acid sequence, it may be selected such that it binds a 
complementary portion of the primer or probe to thereby immobilize the primer or 
probe to the solid phase. In cases where the probe itself serves as the binding 

30 member, those skilled in the art will recognize that the probe will contain a 
sequence or "tail" that is not complementary to the single stranded amplicon 
members. In the case where the primer itself serves as the capture label, at least a 
portion of the primer will be free to hybridize with a nucleic acid on a solid phase 
because the probe is selected such that it is not fully complementary to the primer 

35 sequence. 
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Generally, probe/single stranded amplicon member complexes can be 
detected using techniques commonly employed to perform heterogeneous 
immunoassays. Preferably, in this embodiment, detection is performed according 
to the protocols used by the commercially available Abbott LCx® instrumentation 
5 (Abbott Laboratories, Abbott Park, IL). 

The primers and probes disclosed herein are useful in typical PCR assays, 
wherein the test sample is contacted with a pair of primers, amplification is 
performed, the hybridization probe is added, and detection is performed. 

Another method provided by the present invention comprises contacting a 
10 test sample with a plurality of polynucleotides, wherein at least one polynucleotide 
is a BS124 molecule as described herein, hybridizing the test sample with the 
plurality of polynucleotides and detecting hybridization complexes. Hybridization 
complexes are identified and quantitated to compile a profile which is indicative of 
breast tissue disease, such as breast cancer. Expressed RNA sequences may 
15 further be detected by reverse transcription and amplification of the DNA product 
by procedures well-known in the art, including polymerase chain reaction (PCR). 

Drug Screening and Gene Therapy . 

The present invention also encompasses the use of gene therapy methods 
for the introduction of anti-sense BS124 derived molecules, such as 

20 polynucleotides or oligonucleotides of the present invention, into patients with 
conditions associated with abnormal expression of polynucleotides related to a 
breast tissue disease or condition especially breast cancer. These molecules, 
including antisense RNA and DNA fragments and ribozymes, are designed to 
inhibit the translation of BS 124 mRNA, and may be used therapeutically in the 

25 treatment of conditions associated with altered or abnormal expression of BS 1 24 
polynucleotide. 

Alternatively, the oligonucleotides described above can be delivered to 
cells by procedures known in the art such that the anti-sense RNA or DNA may be 
expressed in vivo to inhibit production of a BS 124 polypeptide in the manner 

30 described above. Antisense constructs to a BS 124 polynucleotide, therefore, 

reverse the action of BS 124 transcripts and may be used for treating breast tissue 
disease conditions, such as breast cancer. These antisense constructs may also be 
used to treat tumor metastases. 

The present invention also provides a method of screening a plurality of 

35 compounds for specific binding to BS124 polypeptide(s), or any fragment 

thereof, to identify at least one compound which specifically binds the BS124 
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polypeptide. Such a method comprises the steps of providing at least one 
compound; combining the BS124 polypeptide with each compound under suitable 
conditions for a time sufficient to allow binding; and detecting the BS124 
polypeptide binding to each compound. 
5 The polypeptide or peptide fragment employed in such a test may either be 

free in solution, affixed to a solid support, borne on a cell surface or located 
intracellularly. One method of screening utilizes eukaryotic or prokaryotic host 
cells which are stably transfected with recombinant nucleic acids which can 
express the polypeptide or peptide fragment. A drug, compound, or any other 

10 agent may be screened against such transfected cells in competitive binding 

assays. For example, the formation of complexes between a polypeptide and the 
agent being tested can be measured in either viable or fixed cells. 

The present invention thus provides methods of screening for drugs, 
compounds, or any other agent which can be used to treat diseases associated with 

15 BS 124. These methods comprise contacting the agent with a polypeptide or 
fragment thereof and assaying for either the presence of a complex between the 
agent and the polypeptide, or for the presence of a complex between the 
polypeptide and the cell. In competitive binding assays, the polypeptide typically 
is labeled. After suitable incubation, free (or uncomplexed) polypeptide or 

20 fragment thereof is separated from that present in bound form, and the amount of 
free or uncomplexed label is used as a measure of the ability of the particular agent 
to bind to the polypeptide or to interfere with the polypeptide/cell complex. 

The present invention also encompasses the use of competitive screening 
assays in which neutralizing antibodies capable of binding polypeptide specifically 

25 compete with a test agent for binding to the polypeptide or fragment thereof. In 
this manner, the antibodies can be used to detect the presence of any polypeptide 
in the test sample which shares one or more antigenic determinants with a BS124 
polypeptide as provided herein. 

Another technique for screening provides high throughput screening for 

30 compounds having suitable binding affinity to at least one polypeptide of BS 1 24 
disclosed herein. Briefly, large numbers of different small peptide test 
compounds are synthesized on a solid phase, such as plastic pins or some other 
surface. The peptide test compounds are reacted with polypeptide and washed. 
Polypeptide thus bound to the solid phase is detected by methods well-known in 

35 the art. Purified polypeptide can also be coated directly onto plates for use in the 
screening techniques described herein. In addition, non-neutralizing antibodies 
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can be used to capture the polypeptide and immobilize it on the solid support. 
See, for example, EP 84/03564, published on September 13, 1984, which is 
incorporated herein by reference. 

The goal of rational drug design is to produce structural analogs of 
5 biologically active polypeptides of interest or of the small molecules including 
agonists, antagonists, or inhibitors with which they interact. Such structural 
analogs can be used to design drugs which are more active or stable forms of the 
polypeptide or which enhance or interfere with the function of a polypeptide in 
vivo . J. Hodgson, Bio/Technology 9:19-21 (1991), incorporated herein by 
10 reference. 

For example, in one approach, the three-dimensional structure of a 
polypeptide, or of a polypeptide-inhibitor complex, is determined by x-ray 
crystallography, by computer modeling or, most typically, by a combination of the 
two approaches. Both the shape and charges of the polypeptide must be 

1 5 ascertained to elucidate the structure and to determine active site(s) of the 

molecule. Less often, useful information regarding the structure of a polypeptide 
may be gained by modeling based on the structure of homologous proteins. In 
both cases, relevant structural information is used to design analogous 
poly peptide-1 ike molecules or to identify efficient inhibitors 

20 Useful examples of rational drug design may include molecules which 

have improved activity or stability as shown by S. Braxton et al., Biochemistry 
31:7796-7801 (1992), or which act as inhibitors, agonists, or antagonists of 
native peptides as shown by S.B.P. Athauda et al., J Biochem. (Tokyo - ) 113 
(6):742-746 (1993), incorporated herein by reference. 

25 It also is possible to isolate a target-specific antibody selected by an assay 

as described hereinabove, and then to determine its crystal structure. In principle 
this approach yields a pharmacophore upon which subsequent drug design can be 
based. It further is possible to bypass protein crystallography altogether by 
generating anti-idiotypic antibodies ("anti-ids") to a functional, pharmacologically 

30 active antibody. As a mirror image of a mirror image, the binding site of the anti- 
id is an analog of the original receptor. The anti-id then can be used to identify 
and isolate peptides from banks of chemically or biologically produced peptides. 
The isolated peptides then can act as the pharmacophore (that is, a prototype 
pharmaceutical drug). 

35 A sufficient amount of a recombinant polypeptide of the present invention 

may be made available to perform analytical studies such as X-ray 
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crystallography. In addition, knowledge of the polypeptide amino acid sequence 
which is derivable from the nucleic acid sequence provided herein will provide 
guidance to those employing computer modeling techniques in place of, or in 
addition to, x-ray crystallography. 
5 Antibodies specific to a BS 124 polypeptide (e.g., anti-BS 124 antibodies) 

further may be used to inhibit the biological action of the polypeptide by binding to 
the polypeptide. In this manner, the antibodies may be used in therapy, for 
example, to treat breast tissue diseases including breast cancer and its metastases. 

Further, such antibodies can detect the presence or absence of a BS 124 
10 polypeptide in a test sample and, therefore, are useful as diagnostic markers for 
the diagnosis of a breast tissue disease or condition especially breast cancer. Such 
antibodies may also function as a diagnostic marker for breast tissue disease 
conditions, such as breast cancer. 

The present invention also is directed to antagonists and inhibitors of the 
15 polypeptides of the present invention. The antagonists and inhibitors are those 
which inhibit or eliminate the function of the polypeptide. Thus, for example, an 
antagonist may bind to a polypeptide of the present invention and inhibit or 
eliminate its function. The antagonist, for example, could be an antibody against 
the polypeptide which eliminates the activity of a BS124 polypeptide by binding a 
20 BS124 polypeptide, or in some cases the antagonist may be an oligonucleotide. 
Examples of small molecule inhibitors include, but are not limited to, small 
peptides or peptide-like molecules. 

The antagonists and inhibitors may be employed as a composition with a 
pharmaceutically acceptable carrier including, but not limited to, saline, buffered 
25 saline, dextrose, water, glycerol, ethanol and combinations thereof. 

Administration of BS 1 24 polypeptide inhibitors is preferably systemic. The 
present invention also provides an antibody which inhibits the action of such a 
polypeptide. 

Antisense technology can be used to reduce gene expression through 
30 triple-helix formation or antisense DNA or RNA, both of which methods are 
based on binding of a polynucleotide to DNA or RNA. For example, the 5' 
coding portion of the polynucleotide sequence, which encodes for the polypeptide 
of the present invention, is used to design an antisense RNA oligonucleotide of 
from 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be 
35 complementary to a region of the gene involved in transcription, thereby 

preventing transcription and the production of the BS124 polypeptide. For triple 
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helix, see, for example, Lee et al., Nuc. Acids Res . 6:3073 (1979); Cooney et al., 
Science 241:456 (1988); and Dervan et al., Science 251: 1360 (1991) The 
antisense RNA oligonucleotide hybridizes to the mRNA in vivo and blocks 
translation of a mRNA molecule into the BS124 polypeptide. For antisense, see, 
5 for example, Okano, J. Neurochem . 56:560 (1991); and "Oligodeoxynucleotides 
as Antisense Inhibitors of Gene Expression," CRC Press, Boca Raton, Fla. 
(1988). Antisense oligonucleotides act with greater efficacy when modified to 
contain artificial internucleotide linkages which render the molecule resistant to 
nucleolytic cleavage. Such artificial internucleotide linkages include, but are not 
10 limited to, methylphosphonate, phosphorothiolate and phosphoroamydate 
internucleotide linkages. 

Recombinant Technology . 

The present invention provides host cells and expression vectors 
comprising BS124 polynucleotides of the present invention and methods for the 

15 production of the polypeptide(s) they encode. Such methods comprise culturing 
the host cells under conditions suitable for the expression of the BS124 
polynucleotide and recovering the BS 124 polypeptide from the cell culture. 

The present invention also provides vectors which include BS124 
polynucleotides of the present invention, host cells which are genetically 

20 engineered with vectors of the present invention and the production of 
polypeptides of the present invention by recombinant techniques. 

Host cells are genetically engineered (transfected, transduced or 
transformed) with the vectors of this invention which may be cloning vectors or 
expression vectors. The vector may be in the form of a plasmid, a viral particle, a 

25 phage, etc. The engineered host cells can be cultured in conventional nutrient 

media modified as appropriate for activating promoters, selecting transfected cells, 
or amplifying BS 124 gene(s). The culture conditions, such as temperature, pH 
and the like, are those previously used with the host cell selected for expression, 
and will be apparent to the ordinarily skilled artisan. 

30 The polynucleotides of the present invention may be employed for 

producing a polypeptide by recombinant techniques. Thus, the polynucleotide 
sequence may be included in any one of a variety of expression vehicles, in 
particular, vectors or plasmids for expressing a polypeptide. Such vectors include 
chromosomal, nonchromosomal and synthetic DNA sequences, e.g., derivatives 

35 of SV40; bacterial plasmids; phage DNA; yeast plasmids; vectors derived from 
combinations of plasmids and phage DNA, viral DNA such as vaccinia, 
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adenovirus, fowl pox virus and pseudorabies. However, any other plasmid or 
vector may be used so long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety 
of procedures. In general, the DNA sequence is inserted into appropriate 
5 restriction endonuclease sites by procedures known in the art. Such procedures 
and others are deemed to be within the scope of those skilled in the art. The DNA 
sequence in the expression vector is operatively linked to an appropriate 
expression control sequence(s) (promoter) to direct mRNA synthesis. 
Representative examples of such promoters include, but are not limited to, the 

10 LTR or the SV40 promoter, the EL coH lac or trp, the phage lambda P sub L 
promoter and other promoters known to control expression of genes in 
prokaryotic or eukaryotic cells or their viruses. The expression vector also 
contains a ribosome binding site for translation initiation and a transcription 
terminator. The vector may also include appropriate sequences for amplifying 

15 expression. In addition, the expression vectors preferably contain a gene to 
provide a phenotypic trait for selection of transfected host cells such as 
dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such 
as tetracycline or ampicillin resistance in EL. coli . 

The vector containing the appropriate DNA sequence as hereinabove 

20 described, as well as an appropriate promoter or control sequence, may be 

employed to transfect an appropriate host to permit the host to express the protein. 
As representative examples of appropriate hosts, there may be mentioned: bacterial 
cells, such as E. coli . Salmonella tvphimurium : Streptomyces sp. ; fungal cells, 
such as yeast; insect cells, such as Drosophila and Sf9; animal cells, such as 

25 CHO, COS or Bowes melanoma; plant cells, etc. The selection of an appropriate 
host is deemed to be within the scope of those skilled in the art from the teachings 
provided herein. 

More particularly, the present invention also includes recombinant 
constructs comprising one or more of the sequences as broadly described above. 

30 The constructs comprise a vector, such as a plasmid or viral vector, into which a 
sequence of the invention has been inserted, in a forward or reverse orientation. 
In a preferred aspect of this embodiment, the construct further comprises 
regulatory sequences including, for example, a promoter, operably linked to the 
sequence. Large numbers of suitable vectors and promoters are known to those of 

35 skill in the art and are commercially available. The following vectors are provided 
by way of example. Bacterial: pINCY (Incyte Pharmaceuticals Inc., Palo Alto, 
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CA), pSPORTl (Life Technologies, Gaithersburg, MD), pQE70, pQE60, pQE-9 
(Qiagen) pBs, phagescript, psiX174, pBluescript SK, pBsKS, pNH8a, pNH16a, 
pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, 
pRIT5 (Pharmacia); Eukaryotic: pWLneo, pSV2cat, pOG44, pXTl, pSG 
5 (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). However, any other 
plasmid or vector may be used as long as it is replicable and viable in the host. 

Plasmid pINCY is generally identical to the plasmid pSPORTl (available 
from Life Technologies, Gaithersburg, MD) with the exception that it has two 
modifications in the polylinker (multiple cloning site). These modifications are (1) 

10 it lacks a HindEH restriction site and (2) its EcoRI restriction site lies at a different 
location. pINCY is created from pSPORTl by cleaving pSPORTl with both 
HindlH and EcoRI and replacing the excised fragment of the polylinker with 
synthetic DNA fragments (SEQUENCE ID NO 6 and SEQUENCE ID NO 7). 
This replacement may be made in any manner known to those of ordinary skill in 

15 the art. For example, the two nucleotide sequences, SEQUENCE ID NO 6 and 
SEQUENCE ID NO 7, may be generated synthetically with 5' terminal 
phosphates, mixed together, and then ligated under standard conditions for 
performing staggered end ligations into the pSPORTl plasmid cut with Hindin 
and EcoRI. Suitable host cells (such as R coli DH5ji cells) then are transfected 

20 with the ligated DNA and recombinant clones are selected for ampicillin resistance. 
Plasmid DNA then is prepared from individual clones and subjected to restriction 
enzyme analysis or DNA sequencing in order to confirm the presence of insert 
sequences in the proper orientation. Other cloning strategies known to the 
ordinary artisan also may be employed. 

25 Promoter regions can be selected from any desired gene using CAT 

(chloramphenicol transferase) vectors or other vectors with selectable markers. 
Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lad, lacZ, T3, SP6, T7, gpt, lambda P sub R, P sub L and trp. 
Eukaryotic promoters include cytomegalovirus (CMV) immediate early, herpes 

30 simplex virus (HSV) thymidine kinase, early and late SV40, LTRs from 

retroviruses and mouse metallothionein-I. Selection of the appropriate vector and 
promoter is well within the level of ordinary skill in the art. 

In a further embodiment, the present invention provides host cells 
containing the above-described construct. The host cell can be a higher eukaryotic 

35 cell, such as a mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or 
the host cell can be a prokaryotic cell, such as a bacterial cell. Introduction of the 
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construct into the host cell can be effected by calcium phosphate transfection, 
DEAE-Dextran mediated transfection, or electroporation (L. Davis et al., "Basic 
Methods in Molecular Biology," 2nd edition, Appleton and Lang, Paramount 
Publishing, East Norwalk, CT (1994)). 
5 The constructs in host cells can be used in a conventional manner to 

produce the gene product encoded by the recombinant sequence. Alternatively, 
the polypeptides of the invention can be synthetically produced by conventional 
peptide synthesizers. 

Recombinant proteins can be expressed in mammalian cells, yeast, 

10 bacteria, or other cells, under the control of appropriate promoters. Cell-free 
translation systems can also be employed to produce such proteins using RNAs 
derived from the DNA constructs of the present invention. Appropriate cloning 
and expression vectors for use with prokaryotic and eukaryotic hosts are described 
by Sambrook et al., Molecular Cloning: A Laboratory Manual . Second Edition, 

15 (Cold Spring Harbor, NY, 1989), which is hereby incorporated by reference. 

Transcription of a DNA encoding the polypeptide(s) of the present 
invention by higher eukaryotes is increased by inserting an enhancer sequence into 
the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 
300 bp, that act on a promoter to increase its transcription. Examples include the 

20 SV40 enhancer on the late side of the replication origin (bp 100 to 270), a 

cytomegalovirus early promoter enhancer, a polyoma enhancer on the late side of 
the replication origin and adenovirus enhancers. 

Generally, recombinant expression vectors will include origins of 
replication and selectable markers permitting transfection of the host cell, e.g., the 

25 ampicillin resistance gene of E^ cob and SL cerevisiae TRP1 gene, and a promoter 
derived from a highly-expressed gene to direct transcription of a downstream 
structural sequence. Such promoters can be derived from operons encoding 
glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), alpha factor, acid 
phosphatase, or heat shock proteins, among others. The heterologous structural 

30 sequence is assembled in appropriate phase with translation initiation and 

termination sequences, and preferably, a leader sequence capable of directing 
secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N- 
terminal identification peptide imparting desired characteristics, e.g., stabilization 

35 or simplified purification of expressed recombinant product. 
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Useful expression vectors for bacterial use are constructed by inserting a 
structural DNA sequence encoding a desired protein together with suitable 
translation initiation and termination signals in operable reading phase with a 
functional promoter. The vector will comprise one or more phenotypic selectable 
5 markers and an origin of replication to ensure maintenance of the vector and to, if 
desirable, provide amplification within the host. Suitable prokaryotic hosts for 
transfection include E, coH, Bacillus subtilis . Salmonella typhimurium and various 
species within the genera Pseudomonas . Streptomyces and Staphylococcus , 
although others may also be employed as a routine matter of choice. 

10 Useful expression vectors for bacterial use comprise a selectable marker 

and bacterial origin of replication derived from plasmids comprising genetic 
elements of the well-known cloning vector pBR322 (ATCC 37017). Other 
vectors include but are not limited to PKK223-3 (Pharmacia Fine Chemicals, 
Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, WI). These pBR322 

1 5 "backbone" sections are combined with an appropriate promoter and the structural 
sequence to be expressed. 

Following transfection of a suitable host and growth of the host to an 
appropriate cell density, the selected promoter is derepressed by appropriate means 
(e.g., temperature shift or chemical induction), and cells are cultured for an 

20 additional period. Cells are typically harvested by centrifugation, disrupted by 
physical or chemical means, and the resulting crude extract retained for further 
purification. Microbial cells employed in expression of proteins can be disrupted 
by any convenient method including freeze-thaw cycling, sonication, mechanical 
disruption, or use of cell lysing agents. Such methods are well-known to the 

25 ordinary artisan. 

Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the 
COS-7 lines of monkey kidney fibroblasts described by Gluzman, Cell 23:175 
(1981), and other cell lines capable of expressing a compatible vector, such as the 

30 CI 27, HEK-293, 3T3, CHO, HeLa and BHK cell lines. Mammalian expression 
vectors will comprise an origin of replication, a suitable promoter and enhancer 
and also any necessary ribosome binding sites, polyadenylation sites, splice donor 
and acceptor sites, transcriptional termination sequences and 5' flanking 
nontranscribed sequences. DNA sequences derived from the SV40 viral genome, 

35 for example, SV40 origin, early promoter, enhancer, splice, and polyadenylation 
sites may be used to provide the required nontranscribed genetic elements. 
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Representative, useful vectors include pRc/CMV and pcDNA3 (available from 
Invitrogen, San Diego, CA). 

BS124 polypeptides are recovered and purified from recombinant cell 
cultures by known methods including affinity chromatography, ammonium sulfate 
5 or ethanol precipitation, acid extraction, anion or cation exchange 

chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, hydroxyapatite chromatography or lectin chromatography. It is 
preferred to have low concentrations (approximately 0.1-5 mM) of calcium ion 
present during purification (Price, et al., J. Biol. Chem . 244:917 (1969)). Protein 
10 refolding steps can be used, as necessary, in completing configuration of the 
polypeptide. Finally, high performance liquid chromatography (HPLC) can be 
employed for final purification steps. 

Thus, polypeptides of the present invention may be naturally purified 
products expressed from a high expressing cell line, or a product of chemical 
1 5 synthetic procedures, or produced by recombinant techniques from a prokaryotic 
or eukaryotic host (for example, by bacterial, yeast, higher plant, insect and 
mammalian cells in culture). Depending upon the host employed in a recombinant 
production procedure, the polypeptides of the present invention may be 
glycosylated with mammalian or other eukaryotic carbohydrates or may be non- 
20 glycosylated. The polypeptides of the invention may also include an initial 
methionine amino acid residue. 

The starting plasmids can be constructed from available plasmids in accord 
with published, known procedures. In addition, equivalent plasmids to those 
described are known in the art and will be apparent to one of ordinary skill in the 
25 art. 

The following is the general procedure for the isolation and analysis of 
cDNA clones. In a particular embodiment disclosed herein, mRNA was isolated 
from breast tissue and used to generate the cDNA library. Breast tissue was 
obtained from patients by surgical resection and was classified as tumor or non- 
30 tumor tissue by a pathologist. 

The cDNA inserts from random isolates of the breast tissue libraries were 
sequenced in part, analyzed in detail as set forth in the Examples, and are 
disclosed in the Sequence Listing as SEQUENCE ID NO 1, SEQUENCE ID NO 
2, and SEQUENCE ID NO 3. Also analyzed in detail as set forth in the 
35 Examples, and disclosed in the Sequence Listing, is the full-length sequence of 
clone 1730294 [designated as clone 1730294IH (SEQUENCE ID NO 4)]. The 
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consensus sequence of these inserts is presented as SEQUENCE ID NO 5. These 
polynucleotides may contain an entire open reading frame with or without 
associated regulatory sequences for a particular gene, or they may encode only a 
portion of the gene of interest. This is attributed to the fact that many genes are 
5 several hundred and sometimes several thousand bases in length and, with current 
technology, cannot be cloned in their entirety because of vector limitations, 
incomplete reverse transcription of the first strand, or incomplete replication of the 
second strand. Contiguous, secondary clones containing additional nucleotide 
sequences may be obtained using a variety of methods known to those of skill in 
10 the art. 

Methods for DNA sequencing are well known in the art. Conventional 
enzymatic methods employ DNA polymerase, Klenow fragment, Sequenase (US 
Biochemical Corp, Cleveland, OH) or Taq polymerase to extend DNA chains 
from an oligonucleotide primer annealed to the DNA template of interest. Methods 

1 5 have been developed for the use of both single-stranded and double-stranded 
templates. The chain termination reaction products may be electrophoresed on 
urea/polyacrylamide gels and detected either by autoradiography (for 
radionucleotide labeled precursors) or by fluorescence (for fluorescent-labeled 
precursors). Recent improvements in mechanized reaction preparation, 

20 sequencing and analysis using the fluorescent detection method have permitted 
expansion in the number of sequences that can be determined per day using 
machines such as the Applied Biosystems 377 DNA Sequencers (Applied 
Biosystems, Foster City, CA). 

The reading frame of the nucleotide sequence can be ascertained by several 

25 types of analyses. First, reading frames contained within the coding sequence can 
be analyzed for the presence of start codon ATG and stop codons TGA, TAA or 
TAG. Typically, one reading frame will continue throughout the major portion of 
a cDNA sequence while other reading frames tend to contain numerous stop 
codons. In such cases, reading frame determination is straightforward. In other 

30 more difficult cases, further analysis is required. 

Algorithms have been created to analyze the occurrence of individual 
nucleotide bases at each putative codon triplet. See, for example J.W. Fickett, 
Nuc. Acids Res. 10:5303 (1982). Coding DNA for particular organisms 
(bacteria, plants and animals) tends to contain certain nucleotides within certain 

35 triplet periodicities, such as a significant preference for pyrimidines in the third 
codon position. These preferences have been incorporated into widely available 
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software which can be used to determine coding potential (and frame) of a given 
stretch of DNA. The algorithm-derived information combined with start/stop 
codon information can be used to determine proper frame with a high degree of 
certainty. This, in turn, readily permits cloning of the sequence in the correct 
5 reading frame into appropriate expression vectors. 

The nucleic acid sequences disclosed herein may be joined to a variety of 
other polynucleotide sequences and vectors of interest by means of well- 
established recombinant DNA techniques. See J. Sambrook et al., supra . Vectors 
of interest include cloning vectors, such as plasmids, cosmids, phage derivatives, 

10 phagemids, as well as sequencing, replication and expression vectors, and the 
like. In general, such vectors contain an origin of replication functional in at least 
one organism, convenient restriction endonuclease digestion sites and selectable 
markers appropriate for particular host cells. The vectors can be transferred by a 
variety of means known to those of skill in the art into suitable host cells which 

15 then produce the desired DNA, RNA or polypeptides. 

Occasionally, sequencing or random reverse transcription errors will mask 
the presence of the appropriate open reading frame or regulatory element. In such 
cases, it is possible to determine the correct reading frame by attempting to express 
the polypeptide and determining the amino acid sequence by standard peptide 

20 mapping and sequencing techniques. See, F.M. Ausubel et al., Current Protocols 
in Molecular Biology . John Wiley & Sons, New York, NY (1989). Additionally, 
the actual reading frame of a given nucleotide sequence may be determined by 
transfection of host cells with vectors containing all three potential reading frames. 
Only those cells with the nucleotide sequence in the correct reading frame will 

25 produce a peptide of the predicted length. 

The nucleotide sequences provided herein have been prepared by current, 
state-of-the-art, automated methods and, as such, may contain unidentified 
nucleotides. These will not present a problem to those skilled in the art who wish 
to practice the invention. Several methods employing standard recombinant 

30 techniques, described in J. Sambrook ( supra ) or periodic updates thereof, may be 
used to complete the missing sequence information. The same techniques used for 
obtaining a full length sequence, as described herein, may be used to obtain 
nucleotide sequences. 

Expression of a particular cDNA may be accomplished by subcloning the 

35 cDNA into an appropriate expression vector and transfecting this vector into an 
appropriate expression host. The cloning vector used for the generation of the 
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breast tissue cDNA library can be used for transcribing mRNA of a particular 
cDNA and contains a promoter for beta-galactosidase, an amino-terminal met and 
the subsequent seven amino acid residues of beta-galactosidase. Immediately 
following these eight residues is an engineered bacteriophage promoter useful for 
5 artificial priming and transcription, as well as a number of unique restriction sites, 
including EcoRI, for cloning. The vector can be transfected into an appropriate 
host strain of coli . 

Induction of the isolated bacterial strain with isopropylthiogalactoside 
(IPTG) using standard methods will produce a fusion protein which contains the 

10 first seven residues of beta-galactosidase, about 15 residues of linker and the 

peptide encoded within the cDNA. Since cDNA clone inserts are generated by an 
essentially random process, there is one chance in three that the included cDNA 
will lie in the correct frame for proper translation. If the cDNA is not in the proper 
reading frame, the correct frame can be obtained by deletion or insertion of an 

15 appropriate number of bases by well known methods including in vitro 
mutagenesis, digestion with exonuclease III or mung bean nuclease, or 
oligonucleotide linker inclusion. 

The cDNA can be shuttled into other vectors known to be useful for 
expression of protein in specific hosts. Oligonucleotide primers containing 

20 cloning sites and segments of DNA sufficient to hybridize to stretches at both ends 
of the target cDNA can be synthesized chemically by standard methods. These 
primers can then be used to amplify the desired gene segments by PCR. The 
resulting new gene segments can be digested with appropriate restriction enzymes 
under standard conditions and isolated by gel electrophoresis. Alternately, similar 

25 gene segments can be produced by digestion of the cDNA with appropriate 
restriction enzymes and filling in the missing gene segments with chemically 
synthesized oligonucleotides. Segments of the coding sequence from more than 
one gene can be ligated together and cloned in appropriate vectors to optimize 
expression of recombinant sequence. 

30 Suitable expression hosts for such chimeric molecules include, but are not 

limited to, mammalian cells, such as Chinese Hamster Ovary (CHO) and human 
embryonic kidney (HEK) 293 cells, insect cells, such as Sf9 cells, yeast cells, 
such as Saccharomyces cerevisiae and bacteria, such as E, coli . For each of these 
cell systems, a useful expression vector may also include an origin of replication 

35 to allow propagation in bacteria and a selectable marker such as the beta-lactamase 
antibiotic resistance gene to allow selection in bacteria. In addition, the vectors 
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may include a second selectable marker, such as the neomycin phosphotransferase 
gene, to allow selection in transfected eukaryotic host cells. Vectors for use in 
eukaryotic expression hosts may require the addition of 3' poly A tail if the 
sequence of interest lacks poly A. 
5 Additionally, the vector may contain promoters or enhancers which 

increase gene expression. Such promoters are host specific and include, but are 
not limited to, MMTV, SV40, or metallothionine promoters for CHO cells; trp, 
lac, tac or T7 promoters for bacterial hosts; or alpha factor, alcohol oxidase or 
PGH promoters for yeast. Adenoviral vectors with or without transcription 

10 enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used to drive 
protein expression in mammalian cell lines. Once homogeneous cultures of 
recombinant cells are obtained, large quantities of recombinantly produced protein 
can be recovered from the conditioned medium and analyzed using 
chromatographic methods well known in the art. An alternative method for the 

1 5 production of large amounts of secreted protein involves the transfection of 
mammalian embryos and the recovery of the recombinant protein from milk 
produced by transgenic cows, goats, sheep, etc. Polypeptides and closely related 
molecules may be expressed recombinantly in such a way as to facilitate protein 
purification. One approach involves expression of a chimeric protein which 

20 includes one or more additional polypeptide domains not naturally present on 
human polypeptides. Such purification-facilitating domains include, but are not 
limited to, metal-chelating peptides such as histidine-tryptophan domains that 
allow purification on immobilized metals, protein A domains that allow 
purification on immobilized immunoglobulin and the domain utilized in the 

25 FLAGS extension/affinity purification system (Immunex Corp, Seattle, WA). 
The inclusion of a cleavable linker sequence such as Factor XA or enterokinase 
from Invitrogen (San Diego, CA) between the polypeptide sequence and the 
purification domain may be useful for recovering the polypeptide. 
Immunoassays. 

30 BS124 polypeptides, including fragments, derivatives, and analogs 

thereof, or cells expressing such polypeptides, can be utilized in a variety of 
assays, many of which are described herein, for the detection of antibodies to 
breast tissue. They also can be used as immunogens to produce antibodies. 
These antibodies can be, for example, polyclonal or monoclonal antibodies, 

35 chimeric, single chain and humanized antibodies, as well as Fab fragments, or the 
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product of an Fab expression library. Various procedures known in the art may 
be used for the production of such antibodies and fragments. 

For example, antibodies generated against a polypeptide comprising a 
sequence of the present invention can be obtained by direct injection of the 
5 polypeptide into an animal or by administering the polypeptide to an animal such 
as a mouse, rabbit, goat or human. A mouse, rabbit or goat is preferred. The 
polypeptide is selected from the group consisting of SEQUENCE ID NO 22, 
SEQUENCE ID NO 23, SEQUENCE ID NO 24, SEQUENCE ID NO 25, and 
fragments thereof. The antibody so obtained then will bind the polypeptide itself. 

10 In this manner, even a sequence encoding only a fragment of the polypeptide can 
be used to generate antibodies that bind the native polypeptide. Such antibodies 
then can be used to isolate the polypeptide from test samples such as tissue 
suspected of containing that polypeptide. For preparation of monoclonal 
antibodies, any technique which provides antibodies produced by continuous cell 

15 line cultures can be used. Examples include the hybridoma technique as described 
by Kohler and Milstein, Nature 256:495-497 (1975), the trioma technique, the 
human B-cell hybridoma technique as described by Kozbor et al., Immun. Today 
4:72 (1983) and the EBV-hybridoma technique to produce human monoclonal 
antibodies as described by Cole et al., in Monoclonal Antibodies and Cancer 

20 Therapy . Alan R. Liss, Inc, New York, NY, pp. 77-96 (1985). Techniques 

described for the production of single chain antibodies can be adapted to produce 
single chain antibodies to immunogenic polypeptide products of this invention. 
See, for example, U.S. Patent No. 4,946,778, which is incorporated herein by 
reference. 

25 Various assay formats may utilize the antibodies of the present invention, 

including "sandwich" immunoassays and probe assays. For example, the 
antibodies of the present invention, or fragments thereof, can be employed in 
various assay systems to determine the presence, if any, of BS124 antigen in a test 
sample. For example, in a first assay format, a polyclonal or monoclonal antibody 

30 or fragment thereof, or a combination of these antibodies, which has been coated 
on a solid phase, is contacted with a test sample, to form a first mixture. This first 
mixture is incubated for a time and under conditions sufficient to form 
antigen/antibody complexes. Then, an indicator reagent comprising a monoclonal 
or a polyclonal antibody or a fragment thereof, or a combination of these 

35 antibodies, to which a signal generating compound has been attached, is contacted 
with the antigen/antibody complexes to form a second mixture. This second 
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mixture then is incubated for a time and under conditions sufficient to form 
antibody/antigen/antibody complexes. The presence of BS124 antigen in the test 
sample and captured on the solid phase, if any, is determined by detecting the 
measurable signal generated by the signal generating compound. The amount of 
5 BS 124 antigen present in the test sample is proportional to the signal generated. 

In an alternative assay format, a mixture is formed by contacting: (1) a 
polyclonal antibody, monoclonal antibody, or fragment thereof, which specifically 
binds to BS124 antigen, or a combination of such antibodies bound to a solid 
support; (2) the test sample; and (3) an indicator reagent comprising a monoclonal 

10 antibody, polyclonal antibody, or fragment thereof, which specifically binds to a 
different BS124 antigen (or a combination of these antibodies) to which a signal 
generating compound is attached. This mixture is incubated for a time and under 
conditions sufficient to form antibody/antigen/antibody complexes. The presence, 
if any, of BS124 antigen present in the test sample and captured on the solid phase 

1 5 is determined by detecting the measurable signal generated by the signal generating 
compound. The amount of BS 124 antigen present in the test sample is 
proportional to the signal generated. 

In another assay format, one or a combination of at least two monoclonal 
antibodies of the invention can be employed as a competitive probe for the 

20 detection of antibodies to BS 1 24 antigen. For example, BS 1 24 polypeptides such 
as the recombinant antigens disclosed herein, either alone or in combination, are 
coated on a solid phase. A test sample suspected of containing antibody to BS 124 
antigen then is incubated with an indicator reagent comprising a signal generating 
compound and at least one monoclonal antibody of the invention for a time and 

25 under conditions sufficient to form antigen/antibody complexes of either the test 
sample and indicator reagent bound to the solid phase or the indicator reagent 
bound to the solid phase. The reduction in binding of the monoclonal antibody to 
the solid phase can be quantitatively measured. 

In yet another detection method, each of the monoclonal or polyclonal 

30 antibodies of the present invention can be employed in the detection of BS 124 
antigens in tissue sections, as well as in cells, by immunohistochemical analysis. 
The tissue sections can be cut from either frozen or chemically fixed samples of 
tissue. If the antigens are to be detected in cells, the cells can be isolated from 
blood, urine, breast aspirates, or other bodily fluids. The cells may be obtained 

35 by biopsy, either surgical or by needle. The cells can be isolated by centrifugation 
or magnetic attraction after labeling with magnetic particles or ferrofluids so as to 
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enrich a particular fraction of cells for staining with the antibodies of the present 
invention. Cytochemical analysis wherein these antibodies are labeled directly 
(with, for example, fluorescein, colloidal gold, horseradish peroxidase, alkaline 
phosphatase, etc.) or are labeled by using secondary labeled anti-species 
5 antibodies (with various labels as exemplified herein) to track the histopathology 
of disease also are within the scope of the present invention. 

In addition, these monoclonal antibodies can be bound to matrices similar 
to CNBr-activated Sepharose and used for the affinity purification of specific 
BS 124 polypeptides from cell cultures or biological tissues such as to purify 

10 recombinant and native BS 124 proteins. 

The monoclonal antibodies of the invention also can be used for the 
generation of chimeric antibodies for therapeutic use, or other similar applications. 

The monoclonal antibodies or fragments thereof can be provided 
individually to detect BS 124 antigens. Combinations of the monoclonal 

15 antibodies (and fragments thereof) provided herein also may be used together as 
components in a mixture or "cocktail" of at least one BS124 antibody of the 
invention, along with antibodies which specifically bind to other BS124 regions, 
each antibody having different binding specificities. Thus, this cocktail can 
include the monoclonal antibodies of the invention which are directed to BS124 

20 polypeptides disclosed herein and other monoclonal antibodies specific to other 
antigenic determinants of BS 1 24 antigens or other related proteins. 

The polyclonal antibody or fragment thereof which can be used in the 
assay formats should specifically bind to a BS124 polypeptide or other BS124 
polypeptides additionally used in the assay. The polyclonal antibody used 

25 preferably is of mammalian origin such as, human, goat, rabbit or sheep 
polyclonal antibody which binds BS124 polypeptide. Most preferably, the 
polyclonal antibody is of rabbit origin. The polyclonal antibodies used in the 
assays can be used either alone or as a cocktail of polyclonal antibodies. Since the 
cocktails used in the assay formats are comprised of either monoclonal antibodies 

30 or polyclonal antibodies having different binding specificity to BS 124 

polypeptides, they are useful for the detecting, diagnosing, staging, monitoring, 
prognosticating, preventing or treating, or determining the predisposition to, 
diseases and conditions of the breast, such as breast cancer. 

It is contemplated and within the scope of the present invention that BS 124 

35 antigen may be detectable in assays by use of a recombinant antigen as well as by 
use of a synthetic peptide or purified peptide, which peptide comprises an amino 
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acid sequence of BS 124. The amino acid sequence of such a polypeptide is 
selected from the group consisting of SEQUENCE ID NO 22, SEQUENCE ID 
NO 23, SEQUENCE ID NO 24, SEQUENCE ID NO 25, and fragments thereof. 
It also is within the scope of the present invention that different synthetic, 
5 recombinant or purified peptides, identifying different epitopes of BS 124, can be 
used in combination in an assay for the detecting, diagnosing, staging, 
monitoring, prognosticating, preventing or treating, or determining the 
predisposition to diseases and conditions of the breast, such as breast cancer. In 
this case, all of these peptides can be coated onto one solid phase; or each separate 

10 peptide may be coated onto separate solid phases, such as microparticles, and then 
combined to form a mixture of peptides which can be later used in assays. 
Furthermore, it is contemplated that multiple peptides which define epitopes from 
different antigens may be used for the detection, diagnosis, staging, monitoring, 
prognosis, prevention or treatment of, or determining the predisposition to, 

15 diseases and conditions of the breast, such as breast cancer. Peptides coated on 
solid phases or labeled with detectable labels are then allowed to compete with 
those present in a patient sample (if any) for a limited amount of antibody. A 
reduction in binding of the synthetic, recombinant, or purified peptides to the 
antibody (or antibodies) is an indication of the presence of BS 1 24 antigen in the 

20 patient sample. The presence of BS 1 24 antigen indicates the presence of breast 
tissue disease, especially breast cancer, in the patient. Variations of assay formats 
are known to those of ordinary skill in the art and many are discussed herein 
below. 

In another assay format, the presence of anti-BS124 antibody and/or 
25 BS124 antigen can be detected in a simultaneous assay, as follows. A test sample 
is simultaneously contacted with a capture reagent of a first analyte, wherein said 
capture reagent comprises a first binding member specific for a first analyte 
attached to a solid phase and a capture reagent for a second analyte, wherein said 
capture reagent comprises a first binding member for a second analyte attached to a 
30 second solid phase, to thereby form a mixture. This mixture is incubated for a 
time and under conditions sufficient to form capture reagent/first analyte and 
capture reagent/second analyte complexes. These so-formed complexes then are 
contacted with an indicator reagent comprising a member of a binding pair specific 
for the first analyte labeled with a signal generating compound and an indicator 
35 reagent comprising a member of a binding pair specific for the second analyte 

labeled with a signal generating compound to form a second mixture. This second 
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mixture is incubated for a time and under conditions sufficient to form capture 
reagent/first analyte/indicator reagent complexes and capture reagent/second 
analyte/indicator reagent complexes. The presence of one or more analytes is 
determined by detecting a signal generated in connection with the complexes 
5 formed on either or both solid phases as an indication of the presence of one or 
more analytes in the test sample. In this assay format, recombinant antigens 
derived from the expression systems disclosed herein may be utilized, as well as 
monoclonal antibodies produced from the proteins derived from the expression 
systems as disclosed herein. For example, in this assay system, BS124 antigen 
10 can be the first analyte. Such assay systems are described in greater detail in EP 
Publication No. 0473065. 

In yet other assay formats, the polypeptides disclosed herein may be 
utilized to detect the presence of antibody against BS124 antigen in test samples. 
For example, a test sample is incubated with a solid phase to which at least one 

15 polypeptide such as a recombinant protein or synthetic peptide has been attached. 
The polypeptide is selected from the group consisting of SEQUENCE ED NO 22, 
SEQUENCE ID NO 23, SEQUENCE ID NO 24, SEQUENCE ID NO 25, and 
fragments thereof. These are reacted for a time and under conditions sufficient to 
form antigen/antibody complexes. Following incubation, the antigen/antibody 

20 complex is detected. Indicator reagents may be used to facilitate detection, 

depending upon the assay system chosen. In another assay format, a test sample 
is contacted with a solid phase to which a recombinant protein produced as 
described herein is attached, and also is contacted with a monoclonal or polyclonal 
antibody specific for the protein, which preferably has been labeled with an 

25 indicator reagent. After incubation for a time and under conditions sufficient for 
antibody/antigen complexes to form, the solid phase is separated from the free 
phase, and the label is detected in either the solid or free phase as an indication of 
the presence of antibody against BS 124 antigen. Other assay formats utilizing the 
recombinant antigens disclosed herein are contemplated. These include contacting 

30 a test sample with a solid phase to which at least one antigen from a first source 
has been attached, incubating the solid phase and test sample for a time and under 
conditions sufficient to form antigen/antibody complexes, and then contacting the 
solid phase with a labeled antigen, which antigen is derived from a second source 
different from the first source. For example, a recombinant protein derived from a 

35 first source such as R coH is used as a capture antigen on a solid phase, a test 

sample is added to the so-prepared solid phase, and following standard incubation 
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and washing steps as deemed or required, a recombinant protein derived from a 
different source (i.e., non-E, coH) is utilized as a part of an indicator reagent 
which subsequently is detected. Likewise, combinations of a recombinant antigen 
on a solid phase and synthetic peptide in the indicator phase also are possible. 
Any assay format which utilizes an antigen specific for BS124 produced or 
derived from a first source as the capture antigen and an antigen specific for 
BS124 from a different second source is contemplated. Thus, various 
combinations of recombinant antigens, as well as the use of synthetic peptides, 
purified proteins and the like, are within the scope of this invention. Assays such 
as this and others are described in U.S. Patent No. 5,254,458, which enjoys 
common ownership and is incorporated herein by reference. 

Other embodiments which utilize various other solid phases also are 
contemplated and are within the scope of this invention. For example, ion capture 
procedures for immobilizing an immobilizable reaction complex with a negatively 
charged polymer (described in EP publication 0326100 and EP publication No. 
0406473), can be employed according to the present invention to effect a fast 
solution-phase immunochemical reaction. An immobilizable immune complex is 
separated from the rest of the reaction mixture by ionic interactions between the 
negatively charged poly-anion/immune complex and the previously treated, 
positively charged porous matrix and detected by using various signal generating 
systems previously described, including those described in chemiluminescent 
signal measurements as described in EPO Publication No. 0 273,1 15. 

Also, the methods of the present invention can be adapted for use in 
systems which utilize microparticle technology including automated and semi- 
automated systems wherein the solid phase comprises a microparticle (magnetic or 
non-magnetic). Such systems include those described in, for example, published 
EPO applications Nos. EP 0 425 633 and EP 0 424 634, respectively. 

The use of scanning probe microscopy (SPM) for immunoassays also is a 
technology to which the monoclonal antibodies of the present invention are easily 
adaptable. In scanning probe microscopy, particularly in atomic force 
microscopy, the capture phase, for example, at least one of the monoclonal 
antibodies of the invention, is adhered to a solid phase and a scanning probe 
microscope is utilized to detect antigen/antibody complexes which may be present 
on the surface of the solid phase. The use of scanning tunneling microscopy 
eliminates the need for labels which normally must be utilized in many 
immunoassay systems to detect antigen/antibody complexes. The use of SPM to 
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monitor specific binding reactions can occur in many ways. In one embodiment, 
one member of a specific binding partner (analyte specific substance which is the 
monoclonal antibody of the invention) is attached to a surface suitable for 
scanning. The attachment of the analyte specific substance may be by adsorption 
to a test piece which comprises a solid phase of a plastic or metal surface, 
following methods known to those of ordinary skill in the art. Or, covalent 
attachment of a specific binding partner (analyte specific substance) to a test piece 
which test piece comprises a solid phase of derivatized plastic, metal, silicon, or 
glass may be utilized. Covalent attachment methods are known to those skilled in 
the art and include a variety of means to irreversibly link specific binding partners 
to the test piece. If the test piece is silicon or glass, the surface must be activated 
prior to attaching the specific binding partner. Also, polyelectrolyte interactions 
may be used to immobilize a specific binding partner on a surface of a test piece by 
using techniques and chemistries. The preferred method of attachment is by 
covalent means. Following attachment of a specific binding member, the surface 
may be further treated with materials such as serum, proteins, or other blocking 
agents to minimize non-specific binding. The surface also may be scanned either 
at the site of manufacture or point of use to verify its suitability for assay 
purposes. The scanning process is not anticipated to alter the specific binding 
properties of the test piece. 

While the present invention discloses the preference for the use of solid 
phases, it is contemplated that the reagents such as antibodies, proteins and 
peptides of the present invention can be utilized in non-solid phase assay systems. 
These assay systems are known to those skilled in the art, and are considered to be 
within the scope of the present invention. 

It is contemplated that the reagent employed for the assay can be provided 
in the form of a test kit with one or more containers such as vials or bottles, with 
each container containing a separate reagent such as a probe, primer, monoclonal 
antibody or a cocktail of monoclonal antibodies, or a polypeptide (e.g. 
recombinantly, synthetically produced or purified) employed in the assay. The 
polypeptide is selected from the group consisting of SEQUENCE ID NO 22, 
SEQUENCE ID NO 23, SEQUENCE ID NO 24, SEQUENCE ID NO 25, and 
fragments thereof. Other components such as buffers, controls and the like, 
known to those of ordinary skill in art, may be included in such test kits. It also is 
contemplated to provide test kits which have means for collecting test samples 
comprising accessible body fluids, e.g., blood, urine, saliva and stool. Such 
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tools useful for collection ("collection materials") include lancets and absorbent 
paper or cloth for collecting and stabilizing blood; swabs for collecting and 
stabilizing saliva; cups for collecting and stabilizing urine or stool samples. 
Collection materials, papers, cloths, swabs, cups and the like, may optionally be 
5 treated to avoid denaturation or irreversible adsorption of the sample. The 

collection materials also may be treated with or contain preservatives, stabilizers or 
antimicrobial agents to help maintain the integrity of the specimens. Test kits 
designed for the collection, stabilization and preservation of test specimens 
obtained by surgery or needle biopsy are also useful. It is contemplated that all 
10 kits may be configured in two components which can be provided separately; one 
component for collection and transport of the specimen and the other component 
for the analysis of the specimen. The collection component, for example, can be 
provided to the open market user while the components for analysis can be 
provided to others such as laboratory personnel for determination of the presence, 
15 absence or amount of analyte. Further, kits for the collection, stabilization and 
preservation of test specimens may be configured for use by untrained personnel 
and may be available in the open market for use at home with subsequent 
transportation to a laboratory for analysis of the test sample. 
Tn Vivo Antibody Use. 
20 Antibodies of the present invention can be used in vivo; that is, they can be 

injected into patients suspected of having or having diseases of the breast for 
diagnostic or therapeutic uses. The use of antibodies for in vivo diagnosis is well 
known in the art. Sumerdon et al, Nucl. Med. Biol, 17, 247-254 (1990) have 
described an optimized antibody-chelator for the radioimmunoscintographic 
25 imaging of carcinoembryonic antigen (CEA) expressing tumors using Indium- 1 1 1 
as the label. Griffin et al, J Clin One, 9, 631-640 (1991) have described the use 
of this agent in detecting tumors in patients suspected of having recurrent 
colorectal cancer. The use of similar agents with paramagnetic ions as labels for 
magnetic resonance imaging is know in the art (R. B. Lauffer, Magnetic 
30 Resonance in Medicine, 22, 339-342 (1991). It is anticipated that antibodies 

directed against BS124 antigen can be injected into patients suspected of having a 
disease of the breast such as breast cancer for the purpose of diagnosing or staging 
the disease status of the patient. The label used will depend on the imaging 
modality chosen. Radioactive labels such as Indium- 1 1 1, Technetium-99m, or 
35 Iodine- 1 3 1 can be used for planar scans or single photon emission computed 

tomography (SPECT). Positron emitting labels such as Fluorine- 19 can also be 
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used for positron emission tomography (PET). For MRI, paramagnetic ions such 
as Gadolinium (HI) or Manganese (II) can be used. Localization of the label 
within the breast or external to the breast may allow determination of spread of the 
disease. The amount of label within the breast may allow determination of the 
presence or absence of cancer of the breast. 

For patients known to have a disease of the breast, injection of an antibody 
directed against BS 124 antigen may have therapeutic benefit. The antibody may 
exert its effect without the use of attached agents by binding to BS124 antigen 
expressed on or in the tissue or organ. Alternatively, the antibody may be 
conjugated to cytotoxic agents such as drugs, toxins, or radionuclides to enhance 
its therapeutic effect. Garnett and Baldwin, Cancer Research, 46, 2407-2412 
(1986) have described the preparation of a drug-monoclonal antibody conjugate. 
Pastan et al, Cell, 47, 641-648 (1986) have reviewed the use of toxins conjugated 
to monoclonal antibodies for the therapy of various cancers. Goodwin and 
Meares, Cancer Supplement, 80, 2675-2680 (1997) have described the use of 
Yttrium-90 labeled monoclonal antibodies in various strategies to maximize the 
dose to tumor while limiting normal tissue toxicity. Other known cytotoxic 
radionuclides include Copper-67, Iodine- 131, and Rhenium- 186 all of which can 
be used to label monoclonal antibodies directed against BS 124 antigen for the 
treatment of cancer of the breast. 

EL coli bacteria (clone 1730294) was deposited on June 25, 1997 with the 
American Type Culture Collection (A.T.C.C.), 12301 Parklawn Drive, Rockville, 
Maryland 20852. The deposit was made under the terms of the Budapest Treaty 
and will be maintained for a period of thirty (30) years from the date of deposit, or 
for five (5) years after the last request for the deposit, or for the enforceable period 
of the U.S. patent, whichever is longer. The deposit and any other deposited 
material described herein are provided for convenience only, and are not required 
to practice the present invention in view of the teachings provided herein. The 
cDNA sequence in all of the deposited material is incorporated herein by reference. 
Clone 1730294 was accorded A.T.C.C. Deposit No. 98466. 

The present invention will now be described by way of examples, which 
are meant to illustrate, but not to limit, the scope of the present invention. 
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RXAMPLES 

Exam ple 1 : Identification of Breast Tissue Library BS 1 24 Gene-Specific Clones 

A. Library Comparison of Expressed Sequen ce Tags (EST's) or 
Transcri pt Images . Partial sequences of cDNA clone inserts, so-called "expressed 
sequence tags" (EST's), were derived from cDNA libraries made from breast 
tumor tissues, breast non-tumor tissues and numerous other tissues, both tumor 
and non-tumor and entered into a database (LEFESEQ™ database, available from 
Incyte Pharmaceuticals, Palo Alto, CA) as gene transcript images. See 
International Publication No. WO 95/20681. (A transcript 
image is a listing of the number of EST's for each of the represented genes in a 
given tissue library. EST's sharing regions of mutual sequence overlap are 
classified into clusters. A cluster is assigned a clone number from a representative 
5' EST. Often, a cluster of interest can be extended by comparing its consensus 
sequence with sequences of other EST's which did not meet the criteria for 
automated clustering. The alignment of all available clusters and single EST's 
represent a contig from which a consensus sequence is derived.) The transcript 
images then were evaluated to identify EST sequences that were representative 
primarily of the breast tissue libraries. These target clones then were ranked 
according to their abundance (occurrence) in the target libraries and their absence 
from background libraries. Higher abundance clones with low background 
occurrence were given higher study priority. EST's corresponding to the 
consensus sequence of BS 124 were found in 5.9% (2 of 34) of breast tissue 
libraries. EST's corresponding to the consensus sequence SEQUENCE ID NO 5 
(or fragments thereof) were found in only 0.3% (2 of 640) of the other, non- 
breast, libraries of the data base. Therefore, the consensus sequence or fragment 
thereof was found more than 19 times more often in breast than non-breast 
tissues. Overlapping clones 1730294 (SEQUENCE ID NO 1), 1213903 
(SEQUENCE ID NO 2), g2 185 139 (SEQUENCE ID NO 3), respectively, were 
identified for further study. These represented the minimum number of clones 
which, along with the full-length sequence of clone 1730294 (hereinafter referred 
to as clone 1730294IH, SEQUENCE ID NO 4), were needed to form the contig 
and from which the consensus sequence provided herein (SEQUENCE ID NO 5) 
was derived. 

B. Generation of a Consensus Sequence. The nucleotide sequences of 
clones 1730294 (SEQUENCE ID NO 1), 1213903 (SEQUENCE ID NO 2), 
g 2185139 (SEQUENCE ID NO 3), and 1730294IH (SEQUENCE ID NO 4) were 
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entered in the Sequencher™ Program (available from Gene Codes Corporation, 
Ann Arbor, MI) in order to generate a nucleotide alignment (contig map) and then 
generate their consensus sequence (SEQUENCE ED NO 5). Figures 1 A- IB show 
the nucleotide sequence alignment of these clones and their resultant nucleotide 
5 consensus sequence (SEQUENCE ID NO 5). Figure 2 presents the contig map 
depicting the clones 1730294 (SEQUENCE ID NO 1), 1213903 (SEQUENCE ID 
NO 2), g2 185 139 (SEQUENCE ED NO 3), and 1730294IH (SEQUENCE ED NO 
4), which form overlapping regions of the BS124 gene, and the resultant 
consensus nucleotide sequence (SEQUENCE ED NO 5) of these clones in a 

10 graphic display. Following this, a three-frame translation was performed on the 
consensus sequence (SEQUENCE ID NO 5). The third forward frame was found 
to have an open reading frame encoding a 170 residue amino acid sequence which 
is presented as SEQUENCE ED NO 22. The open reading frame corresponds to 
nucleotides 45-554 of SEQUENCE ED NO 5. A 104 residue BS124 polypeptide 

15 was compared with published sequences using software and techniques known to 
those skilled in the art. The polypeptide sequence of human von Ebner's gland 
protein, a member of the lipocalin superfamily, was found to be partially 
homologous to the 104 residue BS124 polypeptide. The human von Ebner's 
gland protein is described by Blaker et a]., Biochim Biophys Acta 1 172:131-137 

20 (1993), and the sequence therefor has been deposited with GenBank under 
Accession No. L14927. 

Example 2: Sequencing of BS 1 24 EST-Specific Clones 
The DNA sequence of clone 1730294IH (SEQUENCE ID NO 4) of the 
25 BS 124 gene contig was determined using dideoxy termination sequencing with 
dye terminators following known methods [F. Sanger et al., PNAS U.S.A . 
74:5463 (1977)]. 

Because the pINCY vector (available from Incyte Pharmaceuticals, Inc., 
Palo Alto, CA) contains universal priming sites just adjacent to the 3' and 5' 

30 ligation junctions of the inserts, approximately 300 bases of the insert were 

sequenced in both directions using two universal primers (SEQUENCE ED NO 8 
and SEQUENCE ED NO 9, available from New England Biolabs, Beverly, MA, 
and Applied Biosystems Inc, Foster City, CA, respectively). The sequencing 
reactions were run on a poly aery lamide denaturing gel, and the sequences were 

35 determined by an Applied Biosystems 377 Sequencer (available from Applied 
Biosystems, Foster City, CA). Additional sequencing primers (SEQUENCE ED 
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NOS 10-14) were designed from sequence information of the consensus sequence 
(SEQUENCE ID NO 5). These primers then were used to determine the 
remaining DNA sequence of the cloned insert from each DNA strand, as 
previously described. 

5 

Example 3: Nucleic Acid 
A. RNA Extraction from Tissue . Total RNA was isolated from breast 
tissues and from non-breast tissues. Various methods were utilized, including but 
not limited to the lithium chloride/urea technique, known in the art and described 
10 by Kato et al. f J. Virol. 61:2182-2191, 1987), and TRIzol™ (Gibco-BRL, Grand 
Island, NY). 

Briefly, tissue was placed in a sterile conical tube on ice and 10-15 
volumes of 3 M LiCl, 6 M urea, 5 mM EDTA, 0. 1 M (3-mercaptoethanol, 50 mM 
Tris-HCl (pH 7.5) were added. The tissue was homogenized with a Polytron® 

15 homogenizer (Brinkman Instruments, Inc., Westbury, NY) for 30-50 sec on ice. 
The solution was transferred to a 1 5 ml plastic centrifuge tube and placed 
overnight at -20°C. The tube was centrifuged for 90 min at 9,000 x g at 0-4°C and 
the supernatant was immediately decanted. Ten ml of 3 M LiCl were added and 
the tube was vortexed for 5 sec. The tube was centrifuged for 45 min at 1 1 ,000 x 

20 g at 0-4°C. The decanting, resuspension in LiCl, and centrifugation was repeated 
and the final pellet was air dried and suspended in 2 ml of 1 mM EDTA, 0.5% 
SDS, 10 mM Tris (pH 7.5). Twenty microliters (20 \xl) of Proteinase K (20 
mg/ml) were added, and the solution was incubated for 30 min at 37°C with 
occasional mixing. One-tenth volume (0.22-0.25 ml) of 3 M NaCl was added and 

25 the solution was vortexed before transfer into another tube containing 2 ml of 

phenol/chloroform/isoamyl alcohol (PCI). The tube was vortexed for 1-3 sec and 
centrifuged for 20 min at 3,000 x g at 10°C. The PCI extraction was repeated and 
followed by two similar extractions with chloroform/isoamyl alcohol (CI). The 
final aqueous solution was transferred to a prechilled 15 m] Corex glass tube 

30 containing 6 ml of absolute ethanol, the tube was covered with parafilm, and 

placed at -20°C overnight. The tube was centrifuged for 30 min at 10,000 x g at 
0-4°C and the ethanol supernatant was decanted immediately. The RNA pellet was 
washed four times with 10 ml of 75% ice-cold ethanol and the final pellet was air 
dried for 15 min at room temperature. The RNA was suspended in 0.5 ml of 10 

35 mM TE (pH 7.6, 1 mM EDTA) and its concentration was determined 
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spectrophotometrically. RNA samples were aliquoted and stored at -70°C as 
ethanol precipitates. 

The quality of the RNA was determined by agarose gel electrophoresis 
(see Example 5, Northern Blot Analysis) and staining with 0.5 ug/ml ethidium 
bromide for one hour. RNA samples that did not contain intact ribosomal RNAs 
were excluded from the study. 

For RT-PCR analysis, 1 ml of Ultraspec RNA reagent was added to 120 
mg of pulverized tissue in a 2.0 ml polypropylene microfuge tube, homogenized 
with a Polytron® homogenizer (Brinkman Instruments, Inc., Westbury, NY) for 
50 sec and placed on ice for 5 min. Then, 0.2 ml of chloroform was added to 
each sample, followed by vortexing for 15 sec. The sample was placed on ice for 
another 5 min, followed by centrifugation at 12,000 x g for 15 min at 4°C. The 
upper layer was collected and transferred to another RNase-free 2.0 ml microfuge 
tube. An equal volume of isopropanol was added to each sample, and the solution 
was placed on ice for 10 min. The sample was centrifuged at 12,000 x g for 10 
min at 4°C, and the supernatant was discarded. The remaining pellet was washed 
twice with cold 75% ethanol, resuspended by vortexing, and the resuspended 
material was then pelleted by centrifugation at 7500 x g for 5 min at 4°C. Finally, 
the RNA pellet was dried in a Speedvac (Savant, Farmingdale, NY) for 5 min and 
reconstituted in RNase-free water. 

R RNA Extraction from Blnnri Mononu clear Cells. Mononuclear cells 
are isolated from blood samples from patients by centrifugation using Ficoll- 
Hypaque as follows. A 10 ml volume of whole blood is mixed with an equal 
volume of RPMI Medium (Gibco-BRL, Grand Island, NY). This mixture is then 
underlayed with 10 ml of Ficoll-Hypaque (Pharmacia, Piscataway, NJ) and 
centrifuged for 30 minutes at 200 x g. The buffy coat containing the mononuclear 
cells is removed, diluted to 50 ml with Dulbecco's PBS (Gibco-BRL, Grand 
Island, NY) and the mixture centrifuged for 10 minutes at 200 x g. After two 
washes, the resulting pellet is resuspended in Dulbecco's PBS to a final volume of 
1 ml. 

RNA is prepared from the isolated mononuclear cells as described by N. 
Kato et al., JJVirology 61: 2182-2191 (1987). Briefly, the pelleted 
mononuclear cells are brought to a final volume of 1 ml and then are resuspended 
in 250 [iL of PBS and mixed with 2.5 ml of 3 M LiCl, 6 M urea, 5 mM EDTA, 
0.1 M 2-mercaptoethanol, 50 mM Tris-HCl (pH 7.5). The resulting mixture is 
homogenized and incubated at -20°C overnight. The homogenate is centrifuged at 
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8,000 RPM in a Beckman J2-21M rotor for 90 minutes at 0-4°C. The pellet is 
resuspended in 10 ml of 3 M LiCl by vortexing and then centrifuged at 10,000 
RPM in a Beckman J2-21M rotor centrifuge for 45 minutes at 0-4°C. The 
resuspending and pelleting steps then are repeated. The pellet is resuspended in 2 
ml of 1 mM EDTA, 0.5% SDS, 10 mM Tris (pH 7.5) and 400 ug Proteinase K 
with vortexing and then it is incubated at 37°C for 30 minutes with shaking. One 
tenth volume of 3 M NaCl then is added and the mixture is vortexed. Proteins are 
removed by two cycles of extraction with phenol/ chloroform/ isoamyl alcohol 
(PCI) followed by one extraction with chloroform/ isoamyl alcohol (CI). RNA is 
precipitated by the addition of 6 ml of absolute ethanol followed by overnight 
incubation at -20°C. After the precipitated RNA is collected by centrifugation, the 
pellet is washed 4 times in 75% ethanol. The pelleted RNA is then dissolved in 
solution containing 1 mM EDTA, 10 mM Tris-HCl (pH 7.5). 

Non-breast tissues are used as negative controls. The mRNA can be 
further purified from total RNA by using commercially available kits such as oligo 
dT cellulose spin columns (RediCol™ from Pharmacia, Uppsala, Sweden) for the 
isolation of poly-adenylated RNA. Total RNA or mRNA can be dissolved in lysis 
buffer (5 M guanidine thiocyanate, 0.1 M EDTA, pH 7.0) for analysis in the 
ribonuclease protection assay. 

C. RNA Extraction from polysomes . Tissue is minced in saline at 4°C 
and mixed with 2.5 volumes of 0.8 M sucrose in a TK 150 M (150 mM KC1, 5 mM 
MgCl 2 , 50 mM Tris-HCl, pH 7.4) solution containing 6 mM 2-mercaptoethanol. 
The tissue is homogenized in a Teflon-glass Potter homogenizer with five strokes 
at 100-200 rpm followed by six strokes in a Dounce homogenizer, as described by 
B. Mechler, Methods in Enzvmologv 152:241-248 (1987). The homogenate then 
is centrifuged at 12,000 x g for 15 min at 4°C to sediment the nuclei. The 
polysomes are isolated by mixing 2 ml of the supernatant with 6 ml of 2.5 M 
sucrose in TK 150 M and layering this mixture over 4 ml of 2.5 M sucrose in 
TK 150 M in a 38 ml polyallomer tube. Two additional sucrose TK 150 M solutions 
are successively layered onto the extract fraction; a first layer of 13 ml 2.05 M 
sucrose followed by a second layer of 6 ml of 1 .3 M sucrose. The polysomes are 
isolated by centrifuging the gradient at 90,000 x g for 5 hr at 4°C. The fraction 
then is taken from the 1.3 M sucrose/2.05 M sucrose interface with a siliconized 
pasteur pipette and diluted in an equal volume of TE (10 mM Tris-HCl, pH 7.4, 1 
mM EDTA). An equal volume of 90°C SDS buffer (1% SDS, 200 mM NaCl, 20 
mM Tris-HCl, pH 7.4) is added and the solution is incubated in a boiling water 



Atty DktNo. 6120.US.P1 
PATENT 



-62- 

bath for 2 min. Proteins next are digested with a Proteinase-K digestion (50 
mg/ml) for 15 min at 37°C. The mRNA is purified with 3 equal volumes of 
phenol-chloroform extractions followed by precipitation with 0.1 volume of 2 M 
sodium acetate (pH 5.2) and 2 volumes of 100% ethanol at -20°C overnight. The 
precipitated RNA is recovered by centrifugation at 12,000 x g for 10 min at 4°C. 
The RNA is dried and resuspended in TE (pH 7.4) or distilled water. The 
resuspended RNA then can be used in a slot blot or dot blot hybridization assay to 
check for the presence of BS 124 mRNA (see Example 6). 

The quality of nucleic acid and proteins is dependent on the method of 
preparation used. Each sample may require a different preparation technique to 
maximize isolation efficiency of the target molecule. These preparation techniques 
are within the skill of the ordinary artisan. 

Example 4: Ribonuclease Protection Assay 
A Synthesis of Labeled Complementary RN A (cKNA) Hybridization 
Probe and Unlabeled Sense Strand. Labeled antisense and unlabeled sense 
riboprobes were transcribed from the BS124 gene cDNA sequence flanked by 
opposed SP6 and T7 RNA polymerase promoters. The sequence was from a 
pINCY vector (Incyte Pharmaceuticals, Inc. Palo Alto, CA) containing the 
appropriate BS124 cDNA insert. The described plasmid, clone 1730294, which 
contains the BS 124 gene cDNA sequence flanked by opposed SP6 and T7 RNA 
polymerase promoters, was purified using a Qiagen Plasmid Purification Kit 
(Qiagen, Chatsworth, CA). Next, 10 Jig of the plasmid DNA were linearized by 
cutting with restriction enzymes Ddel and HindHI for 1 hr at 37°C. The linearized 
plasmid DNA was purified using the QIAquick Kit (Qiagen, Chatsworth, CA) and 
250 ng were used for the synthesis of antisense transcript from the T7 promoter 
using the Riboprobe® in vitro Transcription System (Promega Corporation, 
Madison, WI) as described by the manufacturer, incorporating 10 |iM (alpha 32 P) 
CTP (Amersham Life Sciences, Inc. Arlington Heights, IL) as a label. To 
generate the sense strand, 10 |Xg of the purified plasmid DNA were cut with 
restriction enzymes NotI and Sail, and 250 ng transcribed as above from the SP6 
promoter except with 500 uM unlabeled CTP. Both sense and antisense strands 
were purified by spin column chromatography. Unlabeled sense strand was 
quantitated by UV absorption at 260 nm. 

R. Hybridization of Labeled Probe to Target . Frozen tissue was 
pulverized to powder under liquid nitrogen, and 100-500 mg of the powder was 
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dissolved in an appropriate volume of lysis buffer, available as a component of the 
Qiagen RNeasy Total RNA Kit (Qiagen, Chatsworth, CA). Further dissolution 
was achieved using a tissue homogenizer. RNA was then isolated as described by 
the supplier's instructions. In addition, a dilution series of a known amount of 
5 sense strand in yeast RNA (Qiagen, Chatsworth, CA) was made for use as a 
positive control. Finally, 2-5 fig of tissue RNA or diluted sense strand were 
mixed directly with 5 x 10 4 cpm of radioactively labeled probe, precipitated with 
ammonium acetate and ethanol, and suspended in 20 jil of a hybridization buffer 
which is a component of the RPA n™ Ribonuclease Protection Assay Kit 
10 (Ambion, Inc., Austin, TX). Hybridization was allowed to proceed overnight at 
45°C. See, J.J. Lee et al., Meth. Enzvmol . 152:633-648 (1987). 

C. RNase Digestion . RNA that was not hybridized to probe was removed 
from the reaction as per the RPA II™ protocol using a solution of RNase A and 
RNase Tl for 30 min at 37°C. Hybridized fragments protected from digestion 

1 5 were then precipitated as described by the manufacturer. The precipitates were 
collected by centrifugation at 12,000 x g for 20 min. 

D. Fragment Analysis . The precipitates were dissolved in denaturing gel 
loading dye (80% formamide, 10 raM EDTA (pH 8.0), 1 mg/ml xylene cyanol, 1 
mg/ml bromophenol blue), heat denatured, and electrophoresed in 6% 

20 polyacrylamide TBE, 8 M urea denaturing gels. The gels were imaged and 
analyzed using the STORM™ storage phosphor autoradiography system 
(Molecular Dynamics, Sunnyvale, CA). Quantitation of protected fragment bands 
was achieved by comparing the peak areas obtained from the test samples to those 
from the known dilutions of the positive control sense strand (see Section B, 

25 supra ). The results were expressed as an image rating score (Table 1). 
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Table 1 . Ribonuclease Protection Results 
Tissue Score 



Normal Breast 


1 + 


Normal Breast 


2+ 


Normal Breast 


- 


Normal Breast 


2+ 


Normal Breast 


1 + 


Breast Cancer 


3+ 


Breast Cancer 


4+ 


Breast Cancer 




Breast Cancer 


2+ 


Breast Cancer 




Breast Cancer 




Normal Lung 




Lung Cancer 




Normal Colon 




Colon Cancer 





5 Detection of a product comprising a sequence selected from the group 

consisting of SEQUENCE ID NO 1, SEQUENCE ID NO 2, SEQUENCE ID NO 
3, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and fragments or complements 
thereof, is indicative of the presence of BS124 mRNAs, suggesting a diagnosis of 
a breast tissue disease or condition, such as breast cancer. 

10 

Example 5: Northern Blottine 
The Northern blot technique is used to identify a specific size RNA species 
in a complex population of RNA using agarose gel electrophoresis and nucleic 
acid hybridization. Briefly, 5-10 ug of total RNA (see Example 3, Nucleic Acid 
1 5 Preparation) were incubated in 1 5 |ll of a solution containing 40 mM 

morphilinopropanesulfonic acid (MOPS) (pH 7.0), 10 mM sodium acetate, 1 mM 
EDTA, 2.2 M formaldehyde, 50% v/v formamide for 15 min at 65°C. The 
denatured RNA was mixed with 2 |il of loading buffer (50% glycerol, 1 mM 
EDTA, 0.4% bromophenol blue, 0.4% xylene cyanol) and loaded into a 
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denaturing 1.0% agarose gel containing 40 mM MOPS (pH 7.0), 10 mM sodium 
acetate, 1 mM EDTA and 2.2 M formaldehyde. The gel was electrophoresed at 60 
V for 1.5 hr, stained with 0.5 |lg/ml ethidium bromide for one hour and rinsed in 
RNase free water for 30-45 min. RNA was transferred from the gel onto nylon 
5 membranes (Brightstar-Plus, Ambion, Inc., Austin, TX) for 1.5 hours using the 
downward alkaline capillary transfer method (Chomczynski, Anal. Biochem. 
201:134-139, 1992). The filter was rinsed with IX SSC and RNA was 
crosslinked to the filter using a Stratalinker™ (Stratagene, Inc., La Jolla, CA) on 
the autocrosslinking mode and dried for 15 min. The membrane was then placed 

10 into a hybridization tube containing 20 ml of preheated prehybridization solution 
(5X SSC, 50% formamide, 5X Denhardt's solution, 100 |Xg/ml denatured salmon 
sperm DNA) and incubated in a 42°C hybridization oven for at least 3 hr. While 
the blot was prehybridizing, a 32 P-labeled random-primed probe was generated 
using the BS124 insert according to the manufacturer's instructions (Gibco-BRL, 

1 5 Grand Island, NY). Half of the probe was boiled for 10 min, quick chilled on ice 
and added to the hybridization tube. Hybridization was carried out at 42°C for at 
least 12 hr. The hybridization solution was discarded and the filter was washed 
twice in 30 ml of 3X SSC, 0. 1 % SDS at 42°C for 15 min, followed by two 
washes in 30 ml of 0.3X SSC, 0. 1% SDS at 60°C for 15 min. each. The filter 

20 was wrapped in Saran Wrap and exposed to Kodak XAR-Omat film for 8-1 20 hr 
and the film was developed for analysis. 

The results of an analysis of BS 124 hybridization to a Northern blot » 
containing a panel of various normal tissues are shown in Figure 3A which 
contains an ethidium bromide (EtBr)-stained RNA gel and a corresponding BS124 

25 Northern blot. The positions of RNA size standards (in Kb) are shown to the left 
of each panel. The blot was exposed for 196 hours but the positive control (lane 
12) was from a 24 hour exposure. As shown in Figure 3 A, the BS124-specific 
probe detected an approximately 0.5 kb RNA in normal testis (lane 10). Lane 12, 
which contains the positive control of RNA isolated from E. coli containing a 

30 BS 1 24 plasmid, was also positive for the BS 1 24 hybridization. 

The results of an analysis of BS 124 hybridization to a Northern blot 
containing various breast tissue extracts (normal breast and breast cancer tissues) 
are shown in Figure 3B which contains an ethidium bromide (EtBr)-stained RNA 
gel and the corresponding BS 124 Northern blot. The positions of RNA size 

35 standards (in kb) are shown to the left of each panel. The blot was exposed for 
196 hours but the positive control (lane 12) was from a 48 hour exposure. As 
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shown in Figure 3B, the BS124-specific probe detected an approximately 0.5 kb 
RNA in two of six breast cancer samples (lanes 10 and 1 1) but not in any of the 
five normal breast samples (lanes 1 - 5). Lane 12, which contains the positive 
control of RNA isolated from E. coli containing a BS 124 plasmid, was also 
5 positive for the BS 1 24 hybridization. 

Detection of a product comprising a sequence selected from the group 
consisting of SEQUENCE ID NO 1, SEQUENCE ID NO 2, SEQUENCE ID NO 
3, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and fragments or complements 
thereof, is indicative of the presence of BS124 mRNAs, suggesting a diagnosis of 
1 0 a breast tissue disease or condition, such as breast cancer. 

Example 6: Dot Blot/Slot Blot 
Dot and slot blot assays are quick methods to evaluate the presence of a 
specific nucleic acid sequence in a complex mix of nucleic acid. To perform such 

15 assays, up to 50 [ig of RNA is mixed in 50 jxl of 50% formamide, 7% 

formaldehyde, IX SSC, incubated 15 min at 68°C, and then cooled on ice. Then, 
100 yd of 20X SSC is added to the RNA mixture and loaded under vacuum onto a 
manifold apparatus that has a prepared nitrocellulose or nylon membrane. The 
membrane is soaked in water, 20X SSC for 1 hour, placed on two sheets of 20X 

20 SSC prewet Whatman #3 filter paper, and loaded into a slot blot or dot blot 

vacuum manifold apparatus. The slot blot is analyzed with probes prepared and 
labeled as described in Example 4, supra . Detection of mRNA corresponding to a 
sequence selected from the group consisting of SEQUENCE ID NO 1, 
SEQUENCE ID NO 2, SEQUENCE ID NO 3, SEQUENCE ID NO 4, 

25 SEQUENCE ID NO 5, and fragments or complements thereof, is an indication of 
the presence of BS 1 24, suggesting a diagnosis of a breast tissue disease or 
condition, such as breast cancer. 

Other methods and buffers which can be utilized in the methods described 
in Examples 5 and 6, but not specifically detailed herein, are known in the art and 

30 are described in J. Sambrook et al.. supra which is incorporated herein by 
reference. 



35 



Example 7: In Situ Hybridization 
This method is useful to directly detect specific target nucleic acid 
sequences in cells using detectable nucleic acid hybridization probes. 
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Tissues are prepared with cross-linking fixative agents such as 
paraformaldehyde or glutaraldehyde for maximum cellular RNA retention. See, 
L. Angerer et al., Methods in Cell Biol . 35:37-71 (1991). Briefly, the tissue is 
placed in greater than 5 volumes of 1 % glutaraldehyde in 50 mM sodium 
5 phosphate, pH 7.5 at 4°C for 30 min. The solution is changed with fresh 

glutaraldehyde solution (1% glutaraldehyde in 50 mM sodium phosphate, pH 7.5) 
for a further 30 min fixing. The fixing solution should have an osmolality of 
approximately 0.375% NaCl. The tissue is washed once in isotonic NaCl to 
remove the phosphate. 

10 The fixed tissues then are embedded in paraffin as follows. The tissue is 

dehydrated though a series of ethanol concentrations for 15 min each: 50% 
(twice), 70% (twice), 85%, 90% and then 100% (twice). Next, the tissue is 
soaked in two changes of xylene for 20 min each at room temperature. The tissue 
is then soaked in two changes of a 1 : 1 mixture of xylene and paraffin for 20 min 

15 each at 60°C; and then in three final changes of paraffin for 15 min each. 

Next, the tissue is cut in 5 UMn sections using a standard microtome and 
placed on a slide previously treated with a tissue adhesive such as 3- 
aminopropyltriethoxysilane. 

Paraffin is removed from the tissue by two 10 min xylene soaks and 

20 rehydrated in a series of ethanol concentrations: 99% (twice), 95%, 85%, 70%, 
50%, 30%; and then in distilled water (twice). The sections are pre-treated with 
0.2 M HC1 for 10 min and permeabilized with 2 |lg/ml Proteinase-K at 37°C for 
15 min. 

Labeled riboprobes transcribed from the BS124 gene plasmid (see 
25 Example 4) are hybridized to the prepared tissue sections and incubated overnight 
at 56°C in 3X standard saline extract and 50% formamide. Excess probe is 
removed by washing in 2X standard saline citrate and 50% formamide followed 
by digestion with 100 |!g/ml RNase A at 37°C for 30 min. Probe fluorescence is 
visualized by illumination with ultraviolet (UV) light under a microscope. 
30 Fluorescence in the cytoplasm is indicative of BS 124 mRNA. Alternatively, the 
sections can be visualized by autoradiography. 



Example 8: Reverse Transcription PCR 
A. One Step RT-PCR Assay . Target-specific primers are designed to 
35 detect the above-described target sequences by reverse transcription PCR using 
methods known in the art. One step RT-PCR is a sequential procedure that 



Atty Dkt No. 6120.US.P1 
PATENT 



-68- 

performs both RT and PCR in a single reaction mixture. The procedure is 
performed in a 200 reaction mixture containing 50 mM (N,N,-bis[2- 
Hydroxyethyl]glycine), pH 8.15, 81.7 mM KOAc, 33.33 mM KOH, 0.01 mg/ml 
bovine serum albumin, 0. 1 mM ethylene diaminetetraacetic acid, 0.02 mg/ml 
5 NaN 3 , 8% w/v glycerol, 150 p:M each of dNTP, 0.25 (iM each primer, 5U rTth 

polymerase, 3.25 mM Mn(OAc) 2 and 5 |ll of target RNA (see Example 3). Since 
RNA and the rTth polymerase enzyme are unstable in the presence of Mn(OAc) 2 , 
the Mn(OAc) 2 should be added just before target addition. Optimal conditions for 
cDNA synthesis and thermal cycling readily can be determined by those skilled in 

10 the art. The reaction is incubated in a Perkin-Elmer Thermal Cycler 480. Optimal 
conditions for cDNA synthesis and thermal cycling can readily be determined by 
those skilled in the art. Conditions which may be found useful include cDNA 
synthesis at 60°-70°C for 15-45 min and 30-45 amplification cycles at 94°C, 1 min; 
55°-70°C, 1 min; 72°C, 2 min. One step RT-PCR also may be performed by using 

15 a dual enzyme procedure with Taq polymerase and a reverse transcriptase enzyme, 
such as MMLV or AMV RT enzymes. 

B. Traditional RT-PCR . A traditional two-step RT-PCR reaction was 
performed, as described by K.Q. Hu et al., Virology 181:721-726 (1991). 
Briefly, 0.5 \ig of extracted mRNA (see Example 3) was reverse transcribed in a 

20 20 (0.1 reaction mixture containing IX PCR II buffer (Perkin-Elmer), 5 mM MgCl 2 , 
1 mM each dNTP, 20 U RNasin, 2.5 U.M random hexamers, and 50 U MMLV 
(Moloney murine leukemia virus) reverse transcriptase (RT). Reverse 
transcription was performed at room temperature for 10 min, 42°C for 30 min in a 
PE-480 thermal cycler (Perkin-Elmer), followed by further incubation at 95°C for 

25 5 min to inactivate the RT. PCR was performed using 2 jj.1 of the cDNA reaction 
in a final PCR reaction volume of 50 (ll containing IX PCR II buffer (Perkin- 
Elmer), 50 mM KC1, 1.5 mM MgCl 2 , 200 |iM dNTPs, 0.5 uM each sense and 
antisense primers (SEQUENCE ID NO 15 and SEQUENCE ID NO 16, 
respectively), and 2.5 U of Taq polymerase. The reaction was incubated in a PE- 

30 480 thermal cycler (Perkin-Elmer), as follows: 35 cycles of amplification (94°C, 
45 sec; 62°C, 45 sec; 70°C, 2 min.); a final extension (72°C, 7 min); and a soak at 
4°C. 

C. PCR Fragment Analysis . The correct products were verified by size 
determination using gel electrophoresis with a SYBR® Green I nucleic acid gel 

35 stain (Molecular Probes, Eugene, OR). Gels were stained with SYBR® Green I at 
a 1 : 10,000 dilution in IX TBE for 45 min. The gels, which are presented in 
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Figures 4A and 4B, were imaged using a STORM™ imaging system (Molecular 
Dynamics, Sunnyvale, CA). Figure 4A shows a 100 bp BS124-specific PCR 
amplification product in lanes 3 through 11, indicating that BS124 mRNA was 
present in 9 of 10 normal and breast cancer samples tested. In Figure 4B, the 100 
5 bp BS124-specific PCR amplification product is observed (faintly) in only one 
normal lung sample (lane 6), suggesting that BS124 is breast specific. 

Detection of a product comprising a sequence selected from the group 
consisting of SEQUENCE ID NO 1, SEQUENCE ID NO 2, SEQUENCE ED NO 3, 
SEQUENCE ED NO 4, SEQUENCE ED NO 5, and fragments or complements 
10 thereof, is indicative of the presence of BS 124 mRNAs, suggesting a diagnosis of a 
breast tissue disease or condition, such as breast cancer. 

Example 9: OH-PCR 

A. Probe selection and Labeling . Target-specific primers (SEQUENCE 
15 ED NO 1 7 and SEQUENCE ID NO 18) and a probe (SEQUENCE ED NO 1 9) 

were designed to detect the above-described target sequences by oligonucleotide 
hybridization PCR. International Publication Nos WO 92/10505, published June 
25, 1992, and WO 92/1 1388, published July 9, 1992, teach methods for labeling 
oligonucleotides at their 5' and 3' ends, respectively. The label-phosphoramidite 

20 reagent was prepared and used to add the label to the oligonucleotide during its 

synthesis. [See N. T. Thuong et al., Tet. Letters 29(46): 5905-5908 (1988); J. S. 
Cohen et al., published U.S. Patent Application 07/246,688 (NTIS ORDER No. 
PAT-APPL-7-246,688) (1989)]. Probes were labeled at their 3' end to prevent 
participation in PCR and the formation of undesired extension products. For one- 

25 step OH-PCR, the probe had a T M at least 15°C below the T M of the primers. The 
primers and probes were utilized as specific binding members, with or without 
detectable labels, using standard phosphoramidite chemistry and/or post-synthetic 
labeling methods which are well-known to one skilled in the art. 

B. One Step Oligo Hybridization PCR . OH-PCR was performed on a 
30 200 |il reaction containing 50 mM (N,N,-bis[2-Hydroxyethyl]glycine), pH 8.15, 

81.7 mM KOAc, 33.33 mM KOH, 0.01 mg/ml bovine serum albumin, 0.1 mM 
ethylene diaminetetraacetic acid, 0.02 mg/ml NaN 3s 8% w/v glycerol, 150 (iM 

each of dNTP, 0.25 jaM each primer, 5.0 nM probe, 5U rTth polymerase, 3.25 
mM Mn(OAc) 2 and target RNA (see Example 3). Since RNA and the rTth 
35 polymerase enzyme are unstable in the presence of Mn(OAc) 2 , the Mn(OAc) 2 was 
added just before target addition. The reaction was incubated in a Perkin-Elmer 
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Thermal Cycler 480. Optimal conditions for cDNA synthesis and thermal cycling 
can be readily determined by those skilled in the art. Conditions which were 
found useful included cDNA synthesis (60°C, 30 min) and 30-45 amplification 
cycles (94°C, 40 sec; 62°C, 60 sec). Before oligo hybridization, the PCR 
5 products were visualized on a 1% agarose gel stained with SYBR® Green (Figure 
5A). Oligo hybridization was performed subsequent to amplification (97°C, 5 
min; 12°C, 5 min; 1 2°C soak). The correct reaction product contained at least one 
of the strands of the PCR product and an internally hybridized probe. 

Figure 5A is a scan of a SYBR® Green-stained agarose gel of BS124 

10 RNA-specific PCR amplification products. This figure shows a 2.25 Kb BS124- 
specific amplicon with BS124 transcript RNA (lane 2), with T-47D cell undiluted 
(neat) RNA (lane 3), and with T-47D cell RNA diluted 1 to 10 (lane 4). The T- 
47D cell line originated from a human breast carcinoma, (A.T.C.C., HTB133, 
Rockville, MD). Lane 1 shows a reagent negative control (water), while lane 5 is 

1 5 a placental DNA negative control. The human placental DNA control did not 
show a 2.25 Kb amplicon suggesting that the 2.25 Kb amplicons observed in 
lanes 2-4 were the result of the amplification of mRNA and not DNA. The reagent 
control did not yield any amplicon indicating the OH-PCR reagents themselves did 
not amplify. 

20 C. OH-PCR Product Analysis . Amplified reaction products were detected 

on an LCx® Analyzer system (available from Abbott Laboratories, Abbott Park, 
IL). Briefly, the specific reaction product was captured by an antibody labeled 
microparticle at a capturable site on the hybridization probe and the complex was 
detected by binding of a detectable antibody conjugate to the PCR product. Only 

25 the complex containing a PCR strand hybridized with the internal probe was 
detectable (Figure 5B). Figure 5B shows the detectable OH-PCR amplified 
products from the same samples but analyzed on an LCx® system. BS124 RNA- 
specific transcript was diluted to determine the number of molecules that could be 
detected using the LCx® Analyzer. Ten BS124 transcript molecules were detected 

30 when 40 PCR amplification cycles were used. In addition, undiluted RNA 
isolated from T-47D cells, a human breast cell line, was detected at 45 PCR 
amplification cycles. The detection of this complex, then, is indicative of the 
presence of BS124 mRNA, suggesting a diagnosis of a breast disease or 
condition, such as breast cancer. 

35 Many other detection formats exist which can be used and/or modified by 

those skilled in the art to detect the presence of amplified or non-amplified BS124- 
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derived nucleic acid sequences including, but not limited to, ligase chain reaction 
(LCR, Abbott Laboratories, Abbott Park, IL); Q-beta replicase (Gene-Trak™, 
Naperville, Illinois), branched chain reaction (Chiron, Emeryville, CA) and strand 
displacement assays (Becton Dickinson, Research Triangle Park, NC). 

5 

Example 10: Synthetic Peptide Production 
Synthetic peptides were modeled and then prepared based upon the 
predicted amino acid sequence of the BS124 polypeptide consensus sequence (see 
Example 1 ). In particular, a number of BS 124 peptides derived from the BS 124 

10 polypeptide sequence of SEQUENCE ID NO 22 were prepared, including the 

peptides of SEQUENCE ID NO 23, SEQUENCE ID NO 24, and SEQUENCE ED 
NO 25. All peptides were synthesized on a Symphony Peptide Synthesizer 
(available from Rainin Instrument Co, Emeryville, CA) or similar instrument, 
using FMOC chemistry, standard cycles and in-situ HBTU activation. Cleavage 

15 and deprotection conditions were as follows: a volume of 2.5 ml of cleavage 

reagent (77.5% v/v trifluoroacetic acid, 15% v/v ethanedithiol, 2.5% v/v water, 
5% v/v thioanisole, 1-2% w/v phenol) was added to the resin, and agitated at 
room temperature for 2-4 hours. The filtrate was then removed and the peptide 
was precipitated from the cleavage reagent with cold diethyl ether. Each peptide 

20 was filtered, purified via reverse-phase preparative HPLC using a 

water/acetonitrile/0. 1 % TFA gradient, and lyophilized. The product was 
confirmed by mass spectrometry. 

The purified peptides were used to immunize animals (see Example 14). 

25 Example 1 la: Expression of Protein in a Cell Line Using Plasmid 577 

A. Construction of a BS124 Expression Plasmid . Plasmid 577, described 
in U.S. patent application Serial No. 08/478,073, filed June 7, 1995 and 
incorporated herein by reference, has been constructed for the expression of 
secreted antigens in a permanent cell line. This plasmid contains the following 

30 DNA segments: (a) a 2.3 Kb fragment of pBR322 containing bacterial beta- 
lactamase and origin of DNA replication; (b) a 1.8 Kb cassette directing 
expression of a neomycin resistance gene under control of HSV-1 thymidine 
kinase promoter and poly-A addition signals; (c) a 1 .9 Kb cassette directing 
expression of a dihydrofolate reductase gene under the control of an SV-40 

35 promoter and poly-A addition signals; (d) a 3.5 Kb cassette directing expression 
of a rabbit immunoglobulin heavy chain signal sequence fused to a modified 
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hepatitis C virus (HCV) E2 protein under the control of the Simian Virus 40 T-Ag 
promoter and transcription enhancer, the hepatitis B virus surface antigen 
(HBsAg) enhancer I followed by a fragment of Herpes Simplex Virus- 1 (HSV-1) 
genome providing poly-A addition signals; and (e) a residual 0.7 Kb fragment of 
5 Simian Virus 40 genome late region of no function in this plasmid. All of the 
segments of the vector were assembled by standard methods known to those 
skilled in the art of molecular biology. 

Plasmids for the expression of secretable BS124 proteins are constructed 
by replacing the hepatitis C virus E2 protein coding sequence in plasmid 577 with 

10 a BS 124 polynucleotide sequence selected from the group consisting of 
SEQUENCE ID NO 1, SEQUENCE ID NO 2, SEQUENCE ID NO 3, 
SEQUENCE ID NO 4, SEQUENCE ID NO 5, and fragments or complements 
thereof, as follows. Digestion of plasmid 577 with Xbal releases the hepatitis C 
virus E2 gene fragment. The resulting plasmid backbone allows insertion of the 

15 BS124 cDNA insert downstream of the rabbit immunoglobulin heavy chain signal 
sequence which directs the expressed proteins into the secretory pathway of the 
cell. The BS 124 cDNA fragment is generated by PCR using standard procedures. 
Encoded in the sense PCR primer sequence is an Xbal site, immediately followed 
by a 12 nucleotide sequence that encodes the amino acid sequence Ser-Asn-Glu- 

20 Leu ("SNEL") to promote signal protease processing, efficient secretion and final 
product stability in culture fluids. Immediately following this 12 nucleotide 
sequence, the primer contains nucleotides complementary to template sequences 
encoding amino acids of the BS124 gene. The antisense primer incorporates a 
sequence encoding the following eight amino acids just before the stop codons: 

25 Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys (SEQUENCE ID NO 26). Within this 

sequence is incorporated a recognition site to aid in analysis and purification of the 
BS124 protein product. A recognition site (termed "FLAG") that is recognized by 
a commercially available monoclonal antibody designated anti-FLAG M2 
(Eastman Kodak, Co., New Haven, CT) can be utilized, as well as other 

30 comparable sequences and their corresponding antibodies. For example, PCR is 
performed using GeneAmp® reagents obtained from Perkin-Elmer-Cetus, as 
directed by the supplier's instructions. PCR primers are used at a final 
concentration of 0.5 uM. PCR is performed on the BS124 plasmid template in a 
100 u.1 reaction for 35 cycles (94°C, 30 seconds; 55°C, 30 seconds; 72°C, 90 

35 seconds) followed by an extension cycle of 72°C for 10 min. 
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B. Transfection of Dihvdrofolate Reductase Deficient Chinese Hamster 
Ovary Cells . The plasmid described supra is transfected into CHO/dhfr- cells 
(DXB-1 1 1, Uriacio et al., PNAS 77:4451-4466 (1980)). These cells are available 
from the A.T.C.C., 12301 Parklawn Drive, Rockville, MD 20852, under 

5 Accession No. CRL 9096. Transfection is carried out using the cationic 

liposome-mediated procedure described by P. L. Feigner et al., PNAS 84:741 3- 
7417 (1987). Particularly, CHO/dhfr- cells are cultured in Ham's F-12 media 
supplemented with 10% fetal calf serum, L-glutamine (1 mM) and freshly seeded 
into a flask at a density of 5-8 x 10 5 cells per flask. The cells are grown to a 

10 confluency of between 60 and 80% for transfection. Twenty micrograms (20|ig) 
of plasmid DNA are added to 1 .5 ml of Opti-MEM I medium and 100 of 
Lipofectin Reagent (Gibco-BRL; Grand Island, NY) are added to a second 1.5 ml 
portion of Opti-MEM I media. The two solutions are mixed and incubated at room 
temperature for 20 min. After the culture medium is removed from the cells, the 

15 cells are rinsed 3 times with 5 ml of Opti-MEM I medium. The Opti-MEM I- 
Lipofection-plasmid DNA solution then is overlaid onto the cells. The cells are 
incubated for 3 hr at 37°C, after which time the Opti-MEM I-Lipofectin-DNA 
solution is replaced with culture medium for an additional 24 hr prior to selection. 

C. Selection and Amplification . One day after transfection, cells are 
20 passaged 1:3 and incubated with dhfr/G418 selection medium (hereafter, "F-12 

minus medium G"). Selection medium is Ham's F-12 with L-glutamine and 
without hypoxanthine, thymidine and glycine (JRH Biosciences, Lenexa, Kansas) 
and 300 |lg per ml G41 8 (Gibco-BRL, Grand Island, NY). Media volume-to- 
surface area ratios of 5 ml per 25 cm 2 are maintained. After approximately two 

25 weeks, DHFR/G418 cells are expanded to allow passage and continuous 
maintenance in F-12 minus medium G. 

Amplification of each of the transfected BS124 cDNA sequences is 
achieved by stepwise selection of DHFR + , G41 8 + cells with methotrexate 
(reviewed by R. Schimke, Cell 37:705-713 [1984]). Cells are incubated with F- 

30 12 minus medium G containing 150 nM methotrexate (MTX) (Sigma, St. Louis, 
MO) for approximately two weeks until resistant colonies appear. Further gene 
amplification is achieved by selection of 150 nM adapted cells with 5 u,M MTX. 

D. Antigen Production . F-12 minus medium G supplemented with 5 \xM 
MTX is overlaid onto just confluent monolayers for 12 to 24 hr at 37°C in 5% 

35 C0 2 . The growth medium is removed and the cells are rinsed 3 times with 
Dulbecco's phosphate buffered saline (PBS) (with calcium and magnesium) 
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(Gibco-BRL; Grand Island, NY) to remove the remaining media/serum which 
may be present. Cells then are incubated with VAS custom medium (VAS custom 
formulation with L-glutamine with HEPES without phenol red, available from 
JRH Bioscience; Lenexa, KS, product number 52-08678P), for 1 hr at 37°C in 
5 5% C0 2 . Cells then are overlaid with VAS for production at 5 ml per T flask. 
Medium is removed after seven days of incubation, retained, and then frozen to 
await purification with harvests 2, 3 and 4. The monolayers are overlaid with 
VAS for 3 more seven day harvests. 

E. Analysis of Breast Tissue Gene BS 124 Antigen Expression . Aliquots 
10 of VAS supernatants from the cells expressing the BS 124 protein construct are 

analyzed, either by SDS-polyacrylamide gel electrophoresis (SDS-PAGE) using 
standard methods and reagents known in the art (Laemmli discontinuous gels), or 
by mass spectrometry. 

F. Purification. Purification of the BS124 protein containing the FLAG 
15 sequence is performed by immunoaffinity chromatography using an affinity matrix 

comprising anti-FLAG M2 monoclonal antibody covalently attached to agarose by 
hydrazide linkage (Eastman Kodak Co., New Haven, CT). Prior to affinity 
purification, protein in pooled VAS medium harvests from roller bottles is 
exchanged into 50 mM Tris-HCl (pH 7.5), 150 mM NaCl buffer using a 

20 Sephadex G-25 (Pharmacia Biotech Inc., Uppsala, Sweden) column. Protein in 
this buffer is applied to the anti-FLAG M2 antibody affinity column. Non-binding 
protein is eluted by washing the column with 50 mM Tris-HCl (pH 7.5), 150 mM 
NaCl buffer. Bound protein is eluted using an excess of FLAG peptide in 50 mM 
Tris-HCl (pH 7.5), 150 mM NaCl. The excess FLAG peptide can be removed 

25 from the purified BS 124 protein by gel electrophoresis or HPLC. 

Although plasmid 577 is utilized in this example, it is known to those 
skilled in the art that other comparable expression systems, such as CMV, can be 
utilized herein with appropriate modifications in reagent and/or techniques and are 
within the skill of the ordinary artisan. 

30 The largest cloned insert containing the coding region of the BS 124 gene is 

then sub-cloned into either (i) a eukaryotic expression vector which may contain, 
for example, a cytomegalovirus (CMV) promoter and/or protein fusible sequences 
which aid in protein expression and detection, or (ii) a bacterial expression vector 
containing a superoxide-dismutase (SOD) and CMP-KDO synthetase (CKS) or 

35 other protein fusion gene for expression of the protein sequence. Methods and 
vectors which are useful for the production of polypeptides which contain fusion 
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sequences of SOD are described in EPO 0196056, published October 1 , 1986, 
which is incorporated herein by reference and those containing fusion sequences 
of CKS are described in EPO Publication No. 0331961, published September 13, 
1989, which publication is also incorporated herein by reference. This so-purified 
5 protein can be used in a variety of techniques, including, but not limited to animal 
immunization studies, solid phase immunoassays, etc. 

Example 1 lb: Expression of Protein in a Cell Line Using pcDNA3. 1/Myc-His 
A. Construction of a BS 1 24 Expression Plasmid . Plasmid 

10 pcDNA3. 1/Myc-His (Cat.# V855-20, Invitrogen, Carlsbad, CA) was developed 
for the expression of secreted antigens by most mammalian cell lines. Expressed 
protein inserts are fused to a myc-his peptide tag. The myc-his tag is a 21 residue 
amino acid sequence having the following sequence: Glu-Gln-Lys-Leu-Ile-Ser- 
Glu- Glu-Asp-Leu-Asn-Met-His-Thr-Glu-His-His-His-His-His-His 

15 (SEQUENCE ID NO 27) and comprises a c-myc oncoprotein epitope and a 

polyhistidine sequence which are useful for the purification of an expressed fusion 
protein by using either anti-myc or anti-his affinity columns, or metal loprotein 
binding columns. 

A plasmid for the expression of secretable BS124 proteins was constructed 

20 by inserting a BS 124 polynucleotide sequence from clone 1730294 (SEQUENCE 
ID NO 1) into the pcDNA3. 1/Myc/His vector. (This plasmid will be referred to as 
pcl730294-M/H.) Prior to construction of pcl730294-M/H, the BS124 cDNA 
sequence was first cloned into a pCR®-Blunt vector as follows. 

The BS124 cDNA fragment was generated by PCR using standard 

25 procedures and using reagents obtained from Stratagene®, Inc. (La Jolla, CA) as 
directed by the manufacturer. PCR primers were used at a final concentration of 
0.5 (LtM. PCR using 5 U of pfu polymerase (Stratagene®, La Jolla, CA) was 
performed on the BS 124 plasmid template (see Example 2) in a 50 u.1 reaction for 
30 cycles (94°C, 1 min; 65°C, 1.5 min; 72°C, 3 min) followed by an extension 

30 cycle of 72°C for 10 min. The sense PCR primer sequence (SEQUENCE ID NO 
20) is identical to that found directly upstream of the BS124 gene insertion site in 
the pINCY vector. The antisense primer sequence, (SEQUENCE ID NO 21), 
incorporated a 5' NotI restriction sequence and a sequence complementary to the 
3' end of the BS124 cDNA insert just upstream of the 3'-most, in-frame stop 

35 codon. Five microliters (5 u.1) of the resulting blunted-ended PCR product were 
ligated into 25 ng of linearized pCR®-Blunt vector (Invitrogen, Carlsbad, CA) 
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interrupting the lethal ccdB gene of the vector. The resulting ligated vector was 
transformed into TOP 10 E\ coh (Invitrogen, Carlsbad, CA) using a One Shot™ 
Transformation Kit (Invitrogen, Carlsbad, CA) following the manufacturer's 
directions. The transformed cells were grown on LB-Kan (50 u.g/ml kanamycin) 
5 selection plates at 37°C. Only a cell containing a plasmid with an interrupted ccdB 
gene will grow after transformation [Grant, S.G.N., PNAS 87:4645-4649 
(1990)]. Transformed colonies were picked and grown up in 3 ml of LB-Kan 
broth at 37°C. Plasmid DNA was isolated by using a QIAprep® (Qiagen Inc., 
Santa Clarita, CA) procedure, as directed by the manufacturer. The DNA was 
10 digested with EcoRI and Noil restriction enzymes to release the BS124 insert 
fragment. The fragment was electrophoresed on 1% Seakem® LE agarose/0.5 
jlg/ml ethidium bromide/TE gel, visualized by UV irradiation, excised and purified 
using QIAquick™ (Qiagen Inc., Santa Clarita, CA) procedures, as directed by the 
manufacturer. 

15 The pcDNA3. 1/Myc-His plasmid DNA was linearized by digestion with 

EcoRI and NotI in the polylinker region of the plasmid DNA. The purified BS 1 24 
fragment was ligated with the resulting plasmid DNA backbone, with the BS124 
sequence downstream of a CMV promoter which directs expression in mammalian 
cells. The ligated plasmid was transformed into DH5oc™ cells (GibcoBRL Grand 

20 Island, NY), as directed by the manufacturer. Briefly, 10 ng of pcDN A3. 1/Myc- 
His containing a BS 124 insert were added to 50 |ll of competent DH5 alpha cells, 
and the contents were mixed gently. The mixture was incubated on ice for 30 
min, heat shocked for 20 sec at 37°C, and placed on ice for an additional 2 min. 
Upon addition of 0.95 ml of LB medium, the mixture was incubated for 1 hr at 

25 37°C while shaking at 225 rpm. The transformed cells then were plated onto 100 
mm LB/Amp (50|Jg/ml ampicillin) plates and grown at 37°C. Colonies were 
picked and grown in 3 ml of LB/Amp broth. Plasmid DNA was purified using a 
QIAprep kit. The presence of the insert was confirmed using techniques known to 
those skilled in the art, including, but not limited to restriction digestion and gel 

30 analysis. (J. Sambrook et al., supra .) 

B. Transfection of Human Embryonic Kidney Cell 293 Cells . The 
BS124 expression plasmid described in section A, supra , is retransformed into 
DH5 alpha cells, plated onto LB/ampicillin agar, and grown up in 10 ml of 
LB/ampicillin broth, as described hereinabove. The plasmid is purified using a 

35 QIAfilter™ Maxi kit (Qiagen, Chatsworth, CA) and is transfected into HEK293 

cells (F.L. Graham et al., J. Gen. Vir. 36:59-72 d 9771 V These cells are available 
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from the A.T.C.C., 12301 Parklawn Drive, Rockville, MD 20852, under 
Accession No. CRL 1573. Transfection is carried out using the cationic 
lipofectamine-mediated procedure described by P. Hawley-Nelson et al., Focus 
15.73 (1993). Particularly, HEK293 cells are cultured in 10 ml DMEM media 
5 supplemented with 10% fetal bovine serum (FBS), L-glutamine (2 mM) and 
freshly seeded into 100 mm culture plates at a density of 9 x 10 s cells per plate. 
The cells are grown at 37 °C to a confluency of between 70% and 80% for 
transfection. Eight micrograms (8 |ig) of plasmid DNA are added to 800 |il of 
Opti-MEM I® medium (Gibco-BRL, Grand Island, NY), and 48-96 pi of 

10 Lipofectamine™ Reagent (Gibco-BRL, Grand Island, NY) are added to a second 
800 |ll portion of Opti-MEM I media. The two solutions are mixed and incubated 
at room temperature for 15-30 min. After the culture medium is removed from the 
cells, the cells are washed once with 10 ml of serum-free DMEM. The Opti-MEM 
I-Lipofectamine-plasmid DNA solution is diluted with 6.4 ml of serum-free 

1 5 DMEM and then overlaid onto the cells. The cells are incubated for 5 hr at 37°C, 
after which time, an additional 8 ml of DMEM with 20% FBS are added. After 
18-24 h, the old medium is aspirated, and the cells are overlaid with 5 ml of fresh 
DMEM with 5% FBS. Supernatants and cell extracts are analyzed for BS124 
gene activity 72 hr after transfection. 

20 C. Analysis of Breast Tissue Gene BS 124 Antigen Expression . The 

culture supernatant, supra , is transferred to cryotubes and stored on ice. HEK293 
cells are harvested by washing twice with 10 ml of cold Dulbecco's PBS and 
lysing by addition of 1.5 ml of CAT lysis buffer (Boehringer Mannheim, 
Indianapolis, IN), followed by incubation for 30 min at room temperature. Lysate 

25 is transferred to 1 .7 ml polypropylene microfuge tubes and centrifuged at 1000 x g 
for 10 min. The supernatant is transferred to new cryotubes and stored on ice. 
Aliquots of supernatants from the cells and the lysate of the cells expressing the 
BS 124 protein construct are analyzed for the presence of BS 124 recombinant 
protein. The aliquots can be run on SDS-polyacrylamide gel electrophoresis 

30 (SDS-PAGE) using standard methods and reagents known in the art. (J. 

Sambrook et al., supra ) These gels can then be blotted onto a solid medium such 
as nitrocellulose, nytran, etc., and the BS124 protein band can be visualized using 
western blotting techniques with anti-myc epitope or anti-histidine monoclonal 
antibodies (Invitrogen, Carlsbad, CA) or anti-BS124 polyclonal serum (see 

35 Example 14). Alternatively, the expressed BS 124 recombinant protein can be 
analyzed by mass spectrometry (see Example 12). 
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D. Purification. Purification of the BS 124 recombinant protein containing 
the myc-his sequence is performed using the Xpress® affinity chromatography 
system (Invitrogen, Carlsbad, CA) containing a nickel-charged agarose resin 
which specifically binds polyhistidine residues. Supernatants from 10 x 100 mm 
5 plates, prepared as described supra , are pooled and passed over the nickel-charged 
column. Non-binding protein is eluted by washing the column with 50 mM Tris- 
HC1 (pH 7.5)/150 mM NaCl buffer, leaving only the myc-his fusion proteins. 
Bound BS 124 recombinant protein then is eluted from the column using either an 
excess of imidazole or histidine, or a low pH buffer. Alternatively, the 

10 recombinant protein can also be purified by binding at the myc-his sequence to an 
affinity column consisting of either anti-myc or anti-histidine monoclonal 
antibodies conjugated through a hydrazide or other linkage to an agarose resin and 
eluting with an excess of myc peptide or histidine, respectively. 

The purified recombinant protein can then be covalently cross-linked to a 

15 solid phase, such as N-hydroxysuccinimide-activated sepharose columns 

(Pharmacia Biotech, Piscataway, NJ), as directed by supplier's instructions. 
Columns containing covalently linked BS124 recombinant protein, can then be 
used to purify anti-BS 124 antibodies from rabbit or mouse sera (see Examples 13 
and 14). 

20 E. Coating Microtiter Plates with BS124 Expressed Proteins. Supernatant 

from a 100 mm plate, as described supra , is diluted in an appropriate volume of 
PBS. Then, 100 u.1 of the resulting mixture is placed into each well of a Reacti- 
Bind™ metal chelate microtiter plate (Pierce, Rockford, IL), incubated at room 
temperature while shaking, and followed by three washes with 200 |Lil each of 

25 PBS with 0.05% Tween® 20. The prepared microtiter plate can then be used to 
screen polyclonal antisera for the presence of BS124 antibodies (see Example 17). 

Although pcDNA3.1/Myc-His is utilized in this example, it is known to 
those skilled in the art that other comparable expression systems can be utilized 
herein with appropriate modifications in reagent and/or techniques and are within 

30 the skill of one of ordinary skill in the art. The largest cloned insert containing the 
coding region of the BS 124 gene is sub-cloned into either (i) a eukaryotic 
expression vector which may contain, for example, a cytomegalovirus (CMV) 
promoter and/or protein fusible sequences which aid in protein expression and 
detection, or (ii) a bacterial expression vector containing a superoxide-dismutase 

35 (SOD) and CMP-KDO synthetase (CKS) or other protein fusion gene for 

expression of the protein sequence. Methods and vectors which are useful for the 
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production of polypeptides which contain fusion sequences of SOD are described 
in published EPO application No. EP 0 196 056, published October 1, 1986, 
which is incorporated herein by reference, and vectors containing fusion 
sequences of CKS are described in published EPO application No. EP 0 331 961, 
5 published September 13, 1 989, which publication is also incorporated herein by 
reference. The purified protein can be used in a variety of techniques, including, 
but not limited to animal immunization studies, solid phase immunoassays, etc. 

Example 12: Chemical Analysis of Breast Tissue Proteins 

10 A. Analysis of Tryptic Peptide Fragments Using MS . Sera from patients 

with breast disease, such as breast cancer, sera from patients with no breast 
disease, extracts of breast tissues or cells from patients with breast disease, such 
as breast cancer, extracts of breast tissues or cells from patients with no breast 
disease, and extracts of tissues or cells from other non-diseased or diseased 

15 organs of patients, are run on a polyacrylamide gel using standard procedures and 
stained with Coomassie Blue. Sections of the gel suspected of containing the 
unknown polypeptide are excised and subjected to an in-gel reduction, 
acetamidation and tryptic digestion. P. Jeno et al., Anal. Bio . 224:451-455 
(1995) and J. Rosenfeld et al., Anal. Bio . 203:173-179 (1992). The gel sections 

20 are washed with 100 mM NH 4 HC0 3 and acetonitrile. The shrunken gel pieces are 
swollen in digestion buffer (50 mM NH 4 HC0 3 , 5 mM CaCl 2 and 12.5 \lg/m\ 
trypsin) at 4°C for 45 min. The supernatant is aspirated and replaced with 5 to 10 
jll of digestion buffer without trypsin and allowed to incubate overnight at 37°C. 
Peptides are extracted with 3 changes of 5% formic acid and acetonitrile and 

25 evaporated to dryness. The peptides are adsorbed to approximately 0.1 of 
POROS R2 sorbent (Perseptive Biosystems, Framingham, Massachusetts) 
trapped in the tip of a drawn gas chromatography capillary tube by dissolving 
them in 10 (0.1 of 5% formic acid and passing it through the capillary. The 
adsorbed peptides are washed with water and eluted with 5% formic acid in 60% 

30 methanol. The eluant is passed directly into the spraying capillary of an API m 
mass spectrometer (Perkin-Elmer Sciex, Thornhill, Ontario, Canada) for analysis 
by nano-electrospray mass spectrometry. M. Wilm et al., Int. J. Mass Spectrom. 
Ion Process 136:167-180(1994) and M. Wilm et al.. Anal. Chem . 66:1-8 (1994). 
The masses of the tryptic peptides are determined from the mass spectrum 

35 obtained off the first quadrupole. Masses corresponding to predicted peptides can 
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be further analyzed in MS/MS mode to give the amino acid sequence of the 
peptide. 

B. Peptide Fragment Analysis Using LC/MS . The presence of 
polypeptides predicted from mRNA sequences found in hyperplastic disease 
5 tissues also can be confirmed using liquid chromatography/tandem mass 
spectrometry (LC/MS/MS). D. Hess et al., METHODS. A Companion to 
Methods in Enzvmology 6:227-238 (1994). The serum specimen or tumor extract 
from the patient is denatured with SDS and reduced with dithiothreitol (1.5 mg/ml) 
for 30 min at 90°C followed by alkylation with iodoacetamide (4 mg/ml) for 15 

1 0 min at 25°C. Following acrylamide electrophoresis, the polypeptides are 
electroblotted to a cationic membrane and stained with Coomassie Blue. 
Following staining, the membranes are washed and sections thought to contain the 
unknown polypeptides are cut out and dissected into small pieces. The 
membranes are placed in 500 u.1 microcentrifuge tubes and immersed in 10 to 20 

15 (J.1 of proteolytic digestion buffer (100 mM Tris-HCl, pH 8.2, containing 0.1 M 
NaCl, 10% acetonitrile, 2 mM CaCl 2 and 5 ug/ml trypsin) (Sigma, St. Louis, 
MO). After 15 hr at 37°C, 3 u.1 of saturated urea and 1 ul of 100 ug/ml trypsin are 
added and incubated for an additional 5 hr at 37°C. The digestion mixture is 
acidified with 3 ul of 10% trifluoroacetic acid and centrifuged to separate 

20 supernatant from membrane. The supernatant is injected directly onto a 

microbore, reverse phase HPLC column and eluted with a linear gradient of 
acetonitrile in 0.05% trifluoroacetic acid. The eluate is fed directly into an 
electrospray mass spectrometer, after passing though a stream splitter if necessary 
to adjust the volume of material. The data is analyzed following the procedures set 

25 forth in Example 12, Section A. 

Example 13: Gene Immunization Protocol 
A. In Vivo Antigen Expression . Gene immunization circumvents protein 
purification steps by directly expressing an antigen in vivo after inoculation of the 

30 appropriate expression vector. Also, production of antigen by this method may 
allow correct protein folding and glycosylation since the protein is produced in 
mammalian tissue. The method utilizes insertion of the gene sequence into a 
plasmid which contains a CMV promoter, expansion and purification of the 
plasmid and injection of the plasmid DNA into the muscle tissue of an animal. 

35 Preferred animals include mice and rabbits. See, for example, H. Davis et al., 
Human Molecular Genetics 2: 1 847-1 85 1 (1993). After one or two booster 
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immunizations, the animal can then be bled, ascites fluid collected, or the animal's 
spleen can be harvested for production of hybridomas. 

B. Plasmid Preparation and Purification . BS 1 24 cDNA sequences are 
generated from the BS124 cDNA-containing vector using appropriate PCR 

5 primers containing suitable 5' restriction sites following the procedures described 
in Example 1 1 . The PCR product is cut with appropriate restriction enzymes and 
inserted into a vector which contains the CMV promoter (for example, pRc/CMV 
or pcDNA3 vectors from Invitrogen, San Diego, CA). This plasmid then is 
expanded in the appropriate bacterial strain and purified from the cell lysate using a 
10 CsCl gradient or a Qiagen plasmid DNA purification column. All these techniques 
are familiar to one of ordinary skill in the art of molecular biology. 

C. Immunization Protocol . Anesthetized animals are immunized 
intramuscularly with 0.1-100 jig of the purified plasmid diluted in PBS or other 
DNA uptake enhancers (Cardiotoxin, 25% sucrose). See, for example, H. Davis 

15 et al.. Human Gene Therapy 4:733-740 (1993); and P. W. Wolff et al., 

Biotechniques 1 1 :474-485 (1991). One to two booster injections are given at 
monthly intervals. 

D. Testing and Use of Antiserum . Animals are bled and the resultant sera 
tested for antibody using peptides synthesized from the known gene sequence (see 

20 Example 1 6) using techniques known in the art, such as western blotting or EIA 
techniques. Antisera produced by this method can then be used to detect the 
presence of the antigen in a patient's tissue or cell extract or in a patient's serum by 
ELISA or Western blotting techniques, such as those described in Examples 15 
through 18. 

25 

Example 14: Production of Antibodies Against BS124 
A. Production of Polyclonal Antisera. Antiserum against BS 124 was 
prepared by injecting rabbits with peptides whose sequences were derived from 
that of the predicted amino acid sequence of the BS124 consensus nucleotide 

30 sequence (SEQUENCE ID NO 5). The synthesis of exemplary BS 1 24 peptides 
(e.g., the peptides of SEQUENCE ID NO 23, SEQUENCE ID NO 24, and 
SEQUENCE ID NO 25) is described in Example 10. Peptides used as 
immunogens were not conjugated to a carrier such as keyhole limpet 
hemocyanine, KLH, (i.e., they were unconjugated.). 

35 Animal Immunization. Female white New Zealand rabbits 

weighing 2 kg or more were used for raising polyclonal antiserum. One animal 
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was immunized per unconjugated peptide (SEQUENCE ID NO 23, SEQUENCE 
ED NO 24, or SEQUENCE ID NO 25). One week prior to the first immunization, 
a 5 to 10 ml blood sample was obtained from each animal to serve as a non- 
immune prebleed sample. 
5 Unconjugated BS 1 24 peptides (SEQUENCE ID NO 23, SEQUENCE ED 

NO 24, and SEQUENCE ID NO 25) were used to prepare the primary 
immunogen by emulsifying 0.5 ml of the peptide at a concentration of 2 mg/ml in 
PBS (pH 7.2) which contained 0.5 ml of complete Freund's adjuvant (CFA) 
(Difco, Detroit, MI). The immunogen was injected into several sites of the animal 

10 via subcutaneous, intraperitoneal, and intramuscular routes of administration. 
Four weeks following the primary immunization, a booster immunization was 
administered. The immunogen used for the booster immunization dose was 
prepared by emulsifying 0.5 ml of the same unconjugated peptide used for the 
primary immunogen, except that the peptide now was diluted to 1 mg/ml with 0.5 

15 ml of incomplete Freund's adjuvant (IF A) (Difco, Detroit, MI). Again, the 

booster dose was administered into several sites via subcutaneous, intraperitoneal 
and intramuscular types of injections. The animals were bled (5 ml) two weeks 
after the booster immunizations and each serum was tested for immunoreactivity to 
the peptide as described below. The booster and bleed schedule was repeated at 4 

20 week intervals until an adequate titer was obtained. The titer or concentration of 
antiserum was determined by using unconjugated peptides in a microtiter EIA as 
described in Example 1 7, below. An antibody titer of 1 :500 or greater was 
considered an adequate titer for further use and study. 

25 Table 2. Titer of rabbit anti-BS124 peptide antisera (I 1 week bleeds 

Peptide Immunog en Titer 

SEQUENCE ID NO 23 41 ,000 

SEQUENCE ID NO 24 1 0,000 

SEQUENCE ID NO 25 45 ,000 

B. Production of Monoclonal Antibody. 

1. Immunization Protocol. Mice are immunized using peptides 
30 which can either be conjugated to a carrier such as KLH [prepared as described 
hereinbelow, or unconjugated (i.e., not conjugated to a carrier such as KLH)] 
except that the amount of the unconjugated or conjugated peptide for monoclonal 
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antibody production in mice is one-tenth the amount used to produce polyclonal 
antisera in rabbits. Thus, the primary immunogen consists of 100 fig of 
unconjugated or conjugated peptide in 0. 1 ml of CFA emulsion while the 
immunogen used for booster immunizations consists of 50 jig of unconjugated or 
5 conjugated peptide in 0. 1 ml of IFA. Hybridomas for the generation of 

monoclonal antibodies are prepared and screened using standard techniques. The 
methods used for monoclonal antibody development follow procedures known in 
the art such as those detailed in Kohler and Milstein, Nature 256:494 (1975) and 
reviewed in J.G.R. Hurrel, ed., Monoclonal Hybridoma Antibodies: Techniques 

10 and Applications . CRC Press. Inc.. Boca Raton. FL (1982). Another method of 
monoclonal antibody development which is based on the Kohler and Milstein 
method is that of L.T. Mimms et al., Virology 176:604-619 (1990), which is 
incorporated herein by reference. 

The immunization regimen (per mouse) consists of a primary 

15 immunization with additional booster immunizations. The primary immunogen 
used for the primary immunization consists of 100 fig of unconjugated or 
conjugated peptide in 50 u.1 of PBS (pH 7.2) previously emulsified in 50 jal of 
CFA. Booster immunizations performed at approximately two weeks and four 
weeks post primary immunization consist of 50 fig of unconjugated or conjugated 

20 peptide in 50 |il of PBS (pH 7.2) emulsified with 50 |il IFA. A total of 1 00 ill of 
this immunogen are inoculated intraperitoneally and subcutaneously into each 
mouse. Individual mice are screened for immune response by microtiter plate 
enzyme immunoassay (EIA) as described in Example 17 approximately four 
weeks after the third immunization. Mice are inoculated either intravenously, 

25 intrasplenically or intraperitoneally with 50 ug of unconjugated or conjugated 
peptide in PBS (pH 7.2) approximately fifteen weeks after the third 
immunization.. 

Three days after this intravenous boost, splenocytes are fused with, for 
example, Sp2/0-Agl4 myeloma cells (Milstein Laboratories, England) using the 

30 polyethylene glycol (PEG) method. The fusions are cultured in Iscove's Modified 
Dulbecco's Medium (IMDM) containing 10% fetal calf serum (FCS), plus 1% 
hypoxan thine, aminopterin and thymidine (HAT). Bulk cultures were screened by 
microtiter plate EIA following the protocol in Example 17. Clones reactive with 
the peptide used an immunogen and non-reactive with other peptides (i.e., BS124 

35 peptides not used as the immunogen) are selected for final expansion. Clones thus 
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selected are expanded, aliquoted and frozen in IMDM containing 10% FCS and 
10% dimethyl sulfoxide, (DMSO). 

2. Peptide Conjugation. Peptide is conjugated to maleimide 
activated KLH (commercially available as Imject®, available from Pierce Chemical 
Company, Rockford, IL). Imject® contains about 250 moles of reactive maleimide 
groups per mole of hemocyanine. The activated KLH is dissolved in phosphate 
buffered saline (PBS, pH 8.4) at a concentration of about 7.7 mg/ml. The peptide 
is conjugated through cysteines occurring in the peptide sequence, or to a cysteine 
previously added to the synthesized peptide in order to provide a point of 
attachment. The peptide is dissolved in DMSO (Sigma Chemical Company, St. 
Louis, MO) and reacted with the activated KLH at a mole ratio of about 1.5 moles 
of peptide per mole of reactive maleimide attached to the KLH. A procedure for 
the conjugation of peptide is provided hereinbelow. It is known to the ordinary 
artisan that the amounts, times and conditions of such a procedure can be varied to 
optimize peptide conjugation. 

The conjugation reaction described hereinbelow is based on obtaining 3 
mg of KLH peptide conjugate ("conjugated peptide"), which contains about 0.77 
pinoles of reactive maleimide groups. This quantity of peptide conjugate usually 
is adequate for one primary injection and four booster injections for production of 
polyclonal antisera in a rabbit. Briefly, peptide is dissolved in DMSO at a 
concentration of 1.16 |Umoles/100 pi of DMSO. One hundred microliters (100 jllI) 
of the DMSO solution are added to 380 pi of the activated KLH solution prepared 
as described hereinabove, and 20 pi of PBS (pH 8.4) are added to bring the 
volume to 500 pi The reaction is incubated overnight at room temperature with 
stirring. The extent of reaction is determined by measuring the amount of 
unreacted thiol in the reaction mixture. The difference between the starting 
concentration of thiol and the final concentration is assumed to be the 
concentration of peptide which has coupled to the activated KLH. The amount of 
remaining thiol is measured using Ellman's reagent (5,5'-dithiobis(2-nitrobenzoic 
acid), Pierce Chemical Company, Rockford, IL). Cysteine standards are made at 
concentrations of 0, 0.1, 0.5, 2, 5, and 20 mM by dissolving 35 mg of cysteine 
HC1 (Pierce Chemical Company, Rockford, DL) in 10 ml of PBS (pH 7.2) and 
diluting the stock solution to the desired concentrations. The photometric 
determination of the concentration of thiol is accomplished by placing 200 pi of 
PBS (pH 8.4) in each well of an Immulon 2® microwell plate (Dynex 
Technologies, Chantilly, VA). Next, 10 pi of standard or reaction mixture are 
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added to each well. Finally, 20 [L\ of Ellman's reagent at a concentration of 1 
mg/ml in PBS (pH 8.4) are added to each well. The wells are incubated for 10 
minutes at room temperature, and the absorbance of all wells is read at 415 nm 
with a microplate reader (such as the BioRad Model 3550, BioRad, Richmond, 
5 CA). The absorbance of the standards is used to construct a standard curve and 
the thiol concentration of the reaction mixture is determined from the standard 
curve. A decrease in the concentration of free thiol is indicative of a successful 
conjugation reaction. Unreacted peptide is removed by dialysis against PBS (pH 
7.2) at room temperature for 6 hours. The conjugate is stored at 2-8°C if it is to be 
10 used immediately; otherwise, it is stored at -20°C or colder. 

3. Production of Ascites Fluid Containing Monoclonal Antibodies. 
Frozen hybridoma cells prepared as described hereinabove are thawed and placed 
into expansion culture. Viable hybridoma cells are inoculated intraperitoneally into 
Pristane treated mice. Ascitic fluid is removed from the mice, pooled, filtered 

15 through a 0.2 |l filter and subjected to an immunoglobulin class G (IgG) analysis 
to determine the volume of the Protein A column required for the purification. 

4. Purification of Monoclonal Antibodies From Ascites Fluid. 
Briefly, filtered and thawed ascites fluid is mixed with an equal volume of Protein 
A sepharose binding buffer (1.5 M glycine, 3.0 M NaCl, pH 8.9) and refiltered 

20 through a 0.2 |l filter. The volume of the Protein A column is determined by the 
quantity of IgG present in the ascites fluid. The eluate then is dialyzed against 
PBS (pH 7.2) overnight at 2-8°C. The dialyzed monoclonal antibody is sterile 
filtered and dispensed in aliquots. The immunoreactivity of the purified 
monoclonal antibody is confirmed by determining its ability to specifically bind to 

25 the peptide used as the immunogen by use of the EIA microtiter plate assay 

procedure of Example 17. The specificity of the purified monoclonal antibody is 
confirmed by determining its lack of binding to irrelevant peptides such as 
peptides of BS 124 not used as the immunogen. The purified anti-BS 124 
monoclonal thus prepared and characterized is placed at either 2-8°C for short term 

30 storage or at -80°C for long term storage. 

5. Further Characterization of Monoclonal Antibody. The isotype 
and subtype of the monoclonal antibody produced as described hereinabove can be 
determined using commercially available kits (available from Amersham. Inc., 
Arlington Heights, EL). Stability testing also can be performed on the monoclonal 

35 antibody by placing an aliquot of the monoclonal antibody in continuous storage at 
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2-8°C and assaying optical density (OD) readings throughout the course of a given 
period of time. 

C. Use of Recombinant Proteins as Immunogens. It is within the scope 
of the present invention that recombinant proteins made as described herein can be 
5 utilized as immunogens in the production of polyclonal and monoclonal 

antibodies, with corresponding changes in reagents and techniques known to 
those skilled in the art. 

Example 15: Purification of Serum Antibodies Which Specifically 

10 Bind to BS 124 Peptides 

Immune sera, obtained as described hereinabove in Examples 13 and/or 
14, is affinity purified using immobilized synthetic peptides prepared as described 
in Example 10, or recombinant proteins prepared as described in Example 1 1 . An 
IgG fraction of the antiserum is obtained by passing the diluted, crude antiserum 

15 over a Protein A column (Affi-Gel protein A, Bio-Rad, Hercules, CA). Elution 
with a buffer (Binding Buffer, supplied by the manufacturer) removes 
substantially all proteins that are not immunoglobulins. Elution with 0. 1 M 
buffered glycine (pH 3) gives an immunoglobulin preparation that is substantially 
free of albumin and other serum proteins. 

20 Immunoaffinity chromatography is performed to obtain a preparation with 

a higher fraction of specific antigen-binding antibody. The peptide used to raise 
the antiserum is immobilized on a chromatography resin, and the specific 
antibodies directed against its epitopes are adsorbed to the resin. After washing 
away non-binding components, the specific antibodies are eluted with 0. 1 M 

25 glycine buffer, pH 2.3. Antibody fractions are immediately neutralized with 1.0 
M Tris buffer (pH 8.0) to preserve immunoreactivity. The chromatography resin 
chosen depends on the reactive groups present in the peptide. If the peptide has an 
amino group, a resin such as Affi-Gel 10 or Affi-Gel 15 is used (Bio-Rad, 
Hercules, CA). If coupling through a carboxy group on the peptide is desired, 

30 Affi-Gel 102 can be used (Bio-Rad, Hercules, CA). If the peptide has a free 
sulfhydryl group, an organomercurial resin such as Affi-Gel 501 can be used 
(Bio-Rad, Hercules, CA). 

Alternatively, spleens can be harvested and used in the production of 
hybridomas to produce monoclonal antibodies following routine methods known 

35 in the art as described hereinabove. 



Atty Dkt No. 6120.US.P1 
PATENT 



-87- 

Example 16: Western Blotting of Tissue Samples 
Protein extracts were prepared by homogenizing tissue samples in 0. 1 M 
Tris-HCl (pH 7.5), 15% (w/v) glycerol, 0.2 mM EDTA, 1.0 mM 1,4- 
dithiothreitol, 10 |ig/ml leupeptin and 1.0 mM phenylmethylsulfonylfluoride (S. 
5 R. Kain et al.. Biotechniques 17:982 (1994). Following homogenization, the 
homogenates were centrifuged at 4°C for 5 minutes to separate supernatant from 
debris. For protein quantitation, 3- 1 0 jal of supernatant were added to 1.5 ml of 
bicinchoninic acid reagent (Sigma, St. Louis, MO), and the resulting absorbance 
at 562 nm was measured. 

10 For SDS-PAGE, samples were adjusted to desired protein concentration 

with Tricine Buffer (Novex, San Diego, CA), mixed with an equal volume of 2X 
Tricine sample buffer (Novex, San Diego, CA), and heated for 5 minutes at 100°C 
in a thermal cycler. Samples were then applied to a Novex 10-20% Precast Tricine 
Gel for electrophoresis. Following electrophoresis, samples were transferred from 

1 5 the gels to nitrocellulose membranes in Novex Tris-Glycine Transfer buffer. 
Membranes were then probed with specific anti-peptide antibodies using the 
reagents and procedures provided in the Western Lights Plus or Western Lights 
(Tropix, Bedford, MA) chemiluminescence detection kits. Chemiluminescent 
bands were visualized by exposing the developed membranes to Hyperfilm ECL 

20 (Amersham, Arlington Heights, IL). 

After visualization of the bands on film, the bands were also visualized 
directly on the membranes by the addition and development of chromogenic 
substrate 5-bromo-4-chloro-3-indolyl phosphate (BCIP). This chromogenic 
solution contains 0.016% BCIP in a solution containing 100 mM NaCl, 5 mM 

25 MgCl 2 and 100 mM Tris-HCl, pH 9.5. The filter was incubated in the solution at 
room temperature until the bands developed to the desired intensity (1-2 days). 
The image was captured by transillumination with a light box and recording the 
image with an Alpha Imager 2000 (Alpha Innotech Corporation, San Leandro, 
CA). Molecular mass determination was made based upon the mobility of pre- 

30 stained molecular weight standards (Novex, San Diego, CA) and biotinylated 
molecular weight standards (Tropix, Bedford, MA). 

Figure 6A shows the results of the Western blot performed on a panel of 
tissue extracts using antiserum against the BS124 synthetic peptide of 
SEQUENCE ID NO 23 (see Example 14). Each lane of Figure 6A represents a 

35 different tissue protein extract: (1) normal kidney; (2) normal bladder; (3) colon 
cancer; (4) normal lung; (5-13) breast cancer; and (14) molecular weight markers 
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(kD). A band near 21 kD was detected in 7 of 9 breast cancers tissue extracts and 
in the colon cancer tissue extract (lane 3). Figure 6B shows similar data. Each 
lane of Figure 6B represents a different tissue protein extract: (1) BPH; (2) 
prostate cancer; (3) lung cancer; (4) normal colon; (5) colon cancer; (6) bladder; 
5 (7) bladder cancer; (8) normal bladder; (9) bladder cancer; (10-12) normal breast; 
(13) breast cancer; and (14) molecular weight markers (kD). A band near 21 kD 
was observed in the breast cancer tissue extract (lane 13) but not in normal breast 
tissue extract (lanes 10-12). Faint bands near 21 kD were observed in the normal 
bladder tissue extract (lanes 6 and 8), while more pronounced 21 kD bands were 

10 observed in bladder cancer tissue extract (lanes 7 and 9). Varying intensities of 
the 21 kD band were also observed in BPH tissue extract (lane 1), prostate cancer 
tissue extract (lane 2), lung cancer tissue extract (lane 3), normal colon tissue 
extract (lanes 6 and 8) and colon cancer tissue extract (lanes 7 and 9). 

Competition experiments were carried out in an analogous manner as 

15 above with the following exception: primary antibodies (anti-BS 124 peptide 
polyclonal antisera) were pre-incubated overnight at 4°C with varying 
concentrations of peptide immunogen prior to exposure to the nitrocellulose filter. 
Development of the Western was continued as above. Antibody binding to the 
band at 21 kD was inhibited at a concentration of 2.6 u\M of the BS 124 peptide 

20 (SEQUENCE ID NO 23) in a breast cancer sample. 

Example 17: EIA Microtiter Plate Assay 
The immunoreactivity of antiserum preferably obtained from rabbits as 
described in Example 14 was determined by means of a microtiter plate EIA, as 

25 follows. Briefly, the synthetic BS 124 peptides of SEQUENCE ID NO 23, 

SEQUENCE ED NO 24, and SEQUENCE ID NO 25, prepared as described in 
Example 10, were dissolved in carbonate buffer (50 mM, pH 9.6) to a final 
concentration of 2 Ug/ml. Next, 100 ul of the peptide or protein solution were 
placed in each well of an Immulon 2® microtiter plate (Dynex Technologies, 

30 Chantilly, VA). The plate was incubated overnight at room temperature and then 
washed four times with deionized water. The wells were blocked by adding 125 
Ul of a suitable protein blocking agent, such as Superblock® (Pierce Chemical 
Company, Rockford, IL), to each well and then immediately discarding the 
solution. This blocking procedure was performed three times. Antiserum 

35 obtained from immunized rabbits or mice, prepared as previously described, was 
diluted in a protein blocking agent (e.g., a 3% Superblock® solution) in PBS 
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containing 0.05% Tween-20® (monolaurate polyoxyethylene ether) (Sigma 
Chemical Company, St. Louis, MO) and 0.05% sodium azide at dilutions of 
1 : 100, 1 :500, 1 :2500, 1 : 12,500, and 1 :62,500 and placed in each well of the 
coated microtiter plate. The wells then were incubated for three hours at room 
5 temperature. Each well was washed four times with deionized water. One 
hundred microliters of alkaline phosphatase-conjugated goat anti-rabbit IgG or 
goat anti-mouse IgG antiserum (Southern Biotech, Birmingham, AB) diluted 
1:2000 in 3% Superblock® solution in phosphate buffered saline containing 0.05% 
Tween 20® and 0.05% sodium azide, were added to each well. The wells were 

10 incubated for two hours at room temperature. Next, each well was washed four 
times with deionized water. One hundred microliters of paranitrophenyl 
phosphate substrate (Kirkegaard and Perry Laboratories, Gaithersburg, MD) then 
were added to each well. The wells were incubated for thirty minutes at room 
temperature. The absorbance at 405 nm was read in each well. Positive reactions 

1 5 were identified by an increase in absorbance at 405 nm in the test well above that 
absorbance given by a non-immune serum (negative control). A positive reaction 
was indicative of the presence of detectable anti-BS124 antibodies. Titers of the 
anti-peptide antisera were calculated from the previously described dilutions of 
antisera and defined as the calculated dilution, where A 405nm =0.5 OD. 

20 

Example 18: Coating of Solid Phase Particles 
A. Coating of Microparticles with Antibodies Which Specifically Bind to 
BS124 Antigen . Affinity purified antibodies which specifically bind to BS124 
protein (see Example 1 5) are coated onto microparticles of polystyrene, 

25 carboxylated polystyrene, polymethylacrylate, or similar particles having a radius 
in the range of about 0.1 to 20 urn. Microparticles may be either passively or 
actively coated. One coating method comprises coating ED AC (l-(3- 
dimethylaminopropyl)-3-ethylcarbodiimide hydrochloride (Aldrich Chemical Co., 
Milwaukee, WI) activated carboxylated latex microparticles with antibodies which 

30 specifically bind to BS124 protein, as follows. Briefly, a final 0.375% solid 
suspension of resin washed carboxylated latex microparticles (available from 
Bangs Laboratories, Carmel, IN or Serodyn, Indianapolis, IN) is mixed in a 
solution containing 50 mM MES buffer, pH 4.0 and 1 50 mg/1 of affinity purified 
anti-BS124 antibody (see Example 14) for 15 min in an appropriate container. 

35 EDAC coupling agent is added to a final concentration of 5.5 Jig/ml to the mixture 
and mixed for 2.5 hr at room temperature. 
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The microparticles then are washed with 8 volumes of a Tween 
20®/sodium phosphate wash buffer (pH 7.2) by tangential flow filtration using a 
0.2 lim Microgon Filtration module. Washed microparticles are stored in an 
appropriate buffer which usually contains a dilute surfactant and irrelevant protein 
5 as a blocking agent, until needed. 

B. Coating of 1/4 Inch Beads . Antibodies which specifically bind to 
BS124-antigen also may be coated on the surface of 1/4 inch polystyrene beads by 
routine methods known in the art (Snitman et al., US Patent 5,273,882, 
incorporated herein by reference) and used in competitive binding or EIA 
10 sandwich assays. 

Polystyrene beads first are cleaned by ultrasonicating them for about 15 
seconds in 10 mM NaHC0 3 buffer at pH 8.0. The beads then are washed in 
deionized water until all fines are removed. Beads then are immersed in an 
antibody solution in 10 mM carbonate buffer, pH 8 to 9.5. The antibody solution 
1 5 can be as dilute as 1 plg/ml in the case of high affinity monoclonal antibodies or as 
concentrated as about 500 Ug/ml for polyclonal antibodies which have not been 
affinity purified. Beads are coated for at least 12 hours at room temperature, and 
then they are washed with deionized water. Beads may be air dried or stored wet 
(in PBS, pH 7.4). They also may be overcoated with protein stabilizers (such as 
20 sucrose) or protein blocking agents used as non-specific binding blockers (such as 
irrelevant proteins, Carnation skim milk, Superblock®, or the like). 

Example 1 9: Microparticle Enzyme Immunoassay (MEIA) 
BS 124 antigens are detected in patient test samples by performing a 
25 standard antigen competition EIA or antibody sandwich EIA and utilizing a solid 
phase such as microparticles (MEIA). The assay can be performed on an 
automated analyzer such as the IMx® Analyzer (Abbott Laboratories, Abbott Park, 
IL). 

A. Antibody Sandwich EIA. Briefly, samples suspected of containing 
30 BS 124 antigen are incubated in the presence of anti-BS 124 antibody -coated 
microparticles (prepared as described in Example 17) in order to form 
antigen/antibody complexes. The microparticles then are washed and an indicator 
reagent comprising an antibody conjugated to a signal generating compound (i.e., 
enzymes such as alkaline phosphatase or horseradish peroxide) is added to the 
35 antigen/antibody complexes or the microparticles and incubated. The 

microparticles are washed and the bound antibody/antigen/antibody complexes are 
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detected by adding a substrate (e.g., 4-methyl umbelliferyl phosphate (MUP), or 
OPD/peroxide, respectively), that reacts with the signal generating compound to 
generate a measurable signal. An elevated signal in the test sample, compared to 
the signal generated by a negative control, detects the presence of BS124 antigen. 
5 The presence of BS 1 24 antigen in the test sample is indicative of a diagnosis of a 
breast disease or condition, such as breast cancer. 

B. Competitive Binding Assay. The competitive binding assay uses a 
peptide or protein that generates a measurable signal when the labeled peptide is 
contacted with an anti-peptide antibody coated microparticle. This assay can be 

10 performed on the IMx® Analyzer (available from Abbott Laboratories, Abbott 
Park, IL). The labeled peptide is added to the BS124 antibody-coated 
microparticles (prepared as described in Example 1 7) in the presence of a test 
sample suspected of containing BS 124 antigen, and incubated for a time and under 
conditions sufficient to form labeled BS 124 peptide (or labeled protein) / bound 

1 5 antibody complexes and/or patient BS 1 24 antigen / bound antibody complexes. 

The BS124 antigen in the test sample competes with the labeled BS124 peptide (or 
BS 124 protein) for binding sites on the microparticle. BS 124 antigen in the test 
sample results in a lowered binding of labeled peptide and antibody coated 
microparticles in the assay since antigen in the test sample and the BS124 peptide 

20 or BS 1 24 protein compete for antibody binding sites. A lowered signal 

(compared to a control) indicates the presence of BS124 antigen in the test sample. 
The presence of BS124 antigen suggests the diagnosis of abreast disease or 
condition, such as breast cancer. 

The BS124 polynucleotides and the proteins encoded thereby which are 

25 provided and discussed hereinabove are useful as markers of breast tissue disease, 
especially breast cancer. Tests based upon the appearance of this marker in a test 
sample such as blood, plasma or serum can provide low cost, non-invasive, 
diagnostic information to aid the physician to make a diagnosis of cancer, to help 
select a therapy protocol, or to monitor the success of a chosen therapy. This 

30 marker may appear in readily accessible body fluids such as blood, urine or stool 
as antigens derived from the diseased tissue which are detectable by 
immunological methods. This marker may be elevated in a disease state, altered in 
a disease state, or be a normal protein of the breast which appears in an 
inappropriate body compartment. 



35 
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Example 20: Immunohistochemical Detection of BS124 Protein 
Antiserum against the BS 1 24 peptide of SEQUENCE ID NO 23 
(described in Example 14, above) was used to immunohistochemically stain a 
variety of normal and diseased tissues using standard procedures. Briefly, frozen 
5 blocks of tissue were cut into 6 micron sections, and placed on microscope slides. 
After fixation in cold acetone, the sections were dried at room temperature, then 
washed with phosphate buffered saline and blocked. The slides were incubated 
with the BS 124 peptide of SEQUENCE ID NO 23 at a dilution of 1 to 75, 
washed, incubated with biotinylated goat anti-rabbit antibody, washed again, and 
10 incubated with avidin labeled with horse radish peroxidase. After a final wash, 
the slides were incubated with 3-amino-9-ethylcarbazole substrate which gives a 
red stain. The slides were counterstained with hematoxylin, mounted, and 
examined under a microscope by a pathologist. 

Breast tissues stained with anti-BS124 antibody showed intense 
1 5 cytoplasmic immunoreactivity in the epithelium of all benign and malignant breast 
specimens. Weak cytoplasmic immunoreactivity was observed in some urothelial 
cells, colon, and bronchial epithelial cells. 

Results of all tissues examined are tabulated in Table 3. The column 
labeled "# pos" refers to the number of specimens positive out of the total number 
20 of specimens. The column labeled "%" refers to the percent of cells which are 
positive averaged over all tissues examined. 
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Table 3. Tissue Staining by Anti-BS124 Antisera 



TISSUE 


# POS 


% 


COMMENTS 


Colon, normal 


11 A 


IU 


Weak immunoreactivity only on 
superficial cells 


Colon, cancer 


2/3 


A f\ 

41) 


Weak to moderate immunoreactivity 


Lung, normal 


0/4 


0 




Lung, bronchial 
epithelium 


2/4 


90 


Weak immunoreactivity on bronchial 
epithelium 


Liver, normal 
hepatocytes 


0/2 


0 


Intracytoplasmic background i 


Liver, normal 
bile duct 
epithelium 


0/2 


0 




Bladder, normal 
epithelium 


2/3 


10 


Weak, cytoplasmic immunoreactivity on 
the superficial cells 


Bladder, normal 
wall 


0/5 


0 


Smooth muscle cells negative 


Bladder cancer 


1/1 


90 


Intense cytoplasmic immunoreactivity 


Breast, benign 
tissue 


12/12 


78 


Moderate immunoreactivity 


Breast, 

fibrocystic 

changes 


1/1 


40 


Weak epithelial immunoreactivity 


Breast 
intraductal 

carcinoma 


6/6 


70 




Breast invasive 
ductal carcinoma 


10/10 


87 


Intense cytoplasmic immunoreactivity 


Breast, phylloid 
tumor 


1/1 


40 


Moderate epithelial immunoreactivity 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GACGCCCAGT GACCTGCCGA GGTCGGCAGC ACAGAGCTCT GGAGATGAAG ACCCTGTTCC 60 

TGGGTGTCAC GCTCGGCCTG GCCGCTGCCC TGTCCTTCAC CCTGGAGGAG GAGGATATCA 12 0 

CAGGGACCTG GTACGTGAAG GCCATGGTGG TCGATAAGGA CTTTCCGGAG GACAGGAGGC 180 

CCAGGAAGGT GTCCCCAGTG AAGGTGACAG CCCTGGGCGG TGGGAAGTTG GAAGCC 2 36 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

GGATATCACA GGGACCTGGT ACGTGAAGGC CATGGTGGTC GATAAGGACT TTCCGGAGGA 60 

CAGGAGGCCC AGGAAGGTGT CCCCAGTGAA GGTGACAGCC CTGGGCGGTG GGAAGTTGGA 12 0 

AGCCACGTTC ACCTTCATGA GGGAGGATCG GTGCATCCAG AAGAAAATCC TGATGCGGAA 180 

GACGGAGGAG CCTGGCAAAT ACAGCGCCTA TGGGGGCAGG AAGCTCATGT ACCTGCAGGA 24 0 

GCTGC 245 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GGGGGAGAAG GACTTTATTT GGAGTCAGGT GGGTGGGAGC AGGGAAGGGT CATGGCTGGA 60 

GGGTAGGTCC AGGTGGTCCG GGCTCTGTGT CTGGTGGTAG GGTGGGCTCT GGAGGTGCAG 120 

ACCCGGGGGC TGCCTAGTGT TCGGGAACGC AGCTTCCCGT CTGCAGGGGC GTGAAAATGT 180 

CCTCCTCCGA GAGTCCCTTG CGCTGCACCA ATTTCTTAAA TTCTTCCAGG GCCTCCCGGT 240 

TGGTATCAGA ATTCCTACCC ACAAGCTTTC CCATGTGGAG CAGGCCCCCA TGGTGCTGGT 3 00 

CTTTGCAGTA AAAGATGTAG TGGTCCCTCC TGGGCAG 337 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GACGCCCAGT GACCTGCCGA GGTCGGCAGC ACAGAGCTCT GGAGATGAAG ACCCTGTTCC 60 

TGGGTGTCAC GCTCGGCCTG GCCGCTGCCC TGTCCTTCAC CCTGGAGGAG GAGGATATCA 12 0 

CAGGGACCTG GTACGTGAAG GCCATGGTGG TCGATAAGGA CTTTCCGGAG GACAGGAGGC 180 

CCAGGAAGGT GTCCCCAGTG AAGGTGACAG CCCTGGGCGG TGGGAAGTTG GAAGCCACGT 240 
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TCACCTTCAT GAGGGAGGAT CGGTGCATCC AGAAGAAAAT CCTGATGCGG AAGACGGAGG 3 00 

AGCCTGGCAA ATACAGCGCC TATGGGGGCA GGAAGCTCAT GTACCTGCAG GAGCTGCCCA 3 60 

GGAGGGACCA CTACATCTTT TACTGCAAAG ACCAGCACCA TGGGGGCCTG CTCCACATGG 42 0 

GAAAGCTTGT GGGTAGGAAT TCTGATACCA ACCGGGAGGC CCTGGAAGAA TTTAAGAAAT 4 80 

TGGTGCAGCG CAAGGGACTC TCGGAGGAGG ACATTTTCAC GCCCCTGCAG ACGGGAAGCT 540 

GCGTTCCCGA ACACTAGGCA GCCCCCGGGT CTGCACCTCC AGAGCCCACC CTACCACCAG 600 

ACACAGAGCC CGGACCACCT GGACCTACCC TCCAGCCATG ACCCTTCCCT GCTCCCACCC 660 

ACCTGACTCC AAATAAAGTC CTTCTCCCCC CA 692 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GACGCCCAGT GACCTGCCGA GGTCGGCAGC ACAGAGCTCT GGAGATGAAG ACCCTGTTCC 60 

TGGGTGTCAC GCTCGGCCTG GCCGCTGCCC TGTCCTTCAC CCTGGAGGAG GAGGATATCA 12 0 

CAGGGACCTG GTACGTGAAG GCCATGGTGG TCGATAAGGA CTTTCCGGAG GACAGGAGGC 180 

CCAGGAAGGT GTCCCCAGTG AAGGTGACAG CCCTGGGCGG TGGGAAGTTG GAAGCCACGT 24 0 

TCACCTTCAT GAGGGAGGAT CGGTGCATCC AGAAGAAAAT CCTGATGCGG AAGACGGAGG 300 

AGCCTGGCAA ATACAGCGCC TATGGGGGCA GGAAGCTCAT GTACCTGCAG GAGCTGCCCA 360 

GGAGGGACCA CTACATCTTT TACTGCAAAG ACCAGCACCA TGGGGGCCTG CTCCACATGG 42 0 

GAAAGCTTGT GGGTAGGAAT TCTGATACCA ACCGGGAGGC CCTGGAAGAA TTTAAGAAAT 480 

TGGTGCAGCG CAAGGGACTC TCGGAGGAGG ACATTTTCAC GCCCCTGCAG ACGGGAAGCT 540 

GCGTTCCCGA ACACTAGGCA GCCCCCGGGT CTGCACCTCC AGAGCCCACC CTACCACCAG 600 

ACACAGAGCC CGGACCACCT GGACCTACCC TCCAGCCATG ACCCTTCCCT GCTCCCACCC 660 

ACCTGACTCC AAATAAAGTC CTTCTCCCCC CA 692 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

AGCTCGGAAT TCCGAGCTTG GATCCTCTAG AGCGGCCGCC GACTAGTGAG CTCGTCGACC 60 
CGGGAATT 68 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

AATTAATTCC CGGGTCGACG AGCTCACTAG TCGGCGGCCG CTCTAGAGGA TCCAAGCTCG 6 0 

GAATTCCG 68 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
AGCGGATAAC AATTTCACAC AGGA 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TGTAAAACGA CGGCCAGT 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
ATCCTGATGC GGAAGACG 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
TATCACAGGG ACCTGGTACG 

(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GCTGTATTTG CCAGGCTC 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
AAAATGTCCT CCTCCGAGAG 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
AGTTGGAAGC CACGTTCAC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
CCCATAGGCG CTGTATTTG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
AGTTGGAAGC CACGTTCAC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
TGGTGCTGGT CTTTGCAGTA AAAG 24 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAAATACAGC GCCTA 15 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CCCAGTCACG ACGTTGTAAA ACG 2 3 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCGGCCGCCG TGTTCGGGAA CGCAGCTT 2 8 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 0 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 

1 5 " 10 15 

Ser Phe Thr Leu Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys 

2 0 25 * 3 0 

Ala Met Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys 

35 40 45 

Val Ser Pro Val Lys Val Thr Ala Leu Gly Gly Gly Lys Leu Glu Ala 

50 ' 55 '* 60 
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(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Leu Glu Glu Glu Asp He Thr Gly Thr Trp Tyr Val Lys Ala Met Val 

1 5 10 15 

Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys Val Ser 
2 0 2 5 30 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Pro Val Lys Val Thr Ala Leu Gly Gly Gly Lys Leu Glu Ala Thr Phe 

15 10 15 

Thr Phe Met Arg Glu Asp Arg Cys He Gin Lys Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

He Leu Met Arg Lys Thr Glu Glu Pro Gly Lys Tyr Ser Ala Tyr Gly 

1 5 10 15 

Gly Arg Lys Leu Met Tyr Leu Gin Glu Leu 

20 " 25 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 "5 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Glu Gin Lys Leu lie Ser Glu Glu Asp Leu Asn Met His Thr Glu Hi 

15 10 15 

His His His His His 
20 
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We Claim: 

1 . A method of detecting the presence of a target BS 124 
polynucleotide in a test sample, said method comprising: 

5 (a) contacting the test sample with at least one BS 124-specific 

polynucleotide or complement thereof; and 

(b) detecting the presence of said target BS 124 polynucleotide in the 
test sample, wherein said BS 124-specific polynucleotide has at least 50% identity 
with a polynucleotide selected from the group consisting of SEQUENCE ED NO 

10 1, SEQUENCE ID NO 2, SEQUENCE ID NO 3, SEQUENCE ID NO 4, 
SEQUENCE ID NO 5, and fragments or complements thereof. 

2 . The method of claim 1 , wherein said target BS 124 polynucleotide 
is attached to a solid phase prior to performing step (a). 

15 

3 . A method for detecting mRNA of BS 1 24 in a test sample, 
comprising: 

(a) performing reverse transcription with at least one primer in order to 
produce cDNA; 

20 (b) amplifying the cDNA obtained from step (a) using BS 124 

oligonucleotides as sense and antisense primers to obtain BS124 amplicon; and 

(c) detecting the presence of said BS 124 amplicon, wherein the BS 124 
oligonucleotides utilized in steps (a) and (b) have at least 50% identity with a 
sequence selected from the group consisting of SEQUENCE ED NO 1 , 

25 SEQUENCE ID NO 2, SEQUENCE ED NO 3, SEQUENCE ID NO 4, 
SEQUENCE ID NO 5, and fragments or complements thereof. 

4 . The method of claim 3, wherein said test sample is reacted with a 
solid phase prior to performing one of steps (a), (b), or (c). 

30 

5 . The method of claim 3, wherein said detection step comprises 
utilizing a detectable label capable of generating a measurable signal. 

6 . A method of detecting a target BS 1 24 polynucleotide in a test 
35 sample suspected of containing said target, comprising: 
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(a) contacting said test sample with at least one BS124 oligonucleotide 
as a sense primer and with at least one BS124 oligonucleotide as an anti-sense 
primer and amplifying to obtain a first stage reaction product; 

(b) contacting said first stage reaction product with at least one other 

5 BS124 oligonucleotide to obtain a second stage reaction product, with the proviso 
that the other BS 124 oligonucleotide is located 3' to the BS 124 oligonucleotides 
utilized in step (a) and is complementary to said first stage reaction product; and 

(c) detecting said second stage reaction product as an indication of the 
presence of the target BS 124 polynucleotide, wherein the BS 124 oligonucleotides 

10 utilized in steps (a) and (b) have at least 50% identity with a sequence selected 
from the group consisting of SEQUENCE ID NO 1 , SEQUENCE ID NO 2, 
SEQUENCE ED NO 3, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and 
fragments or complements thereof. 

15 7 . The method of claim 6, wherein said test sample is reacted with a 

solid phase prior to performing one of steps (a), (b), or (c). 

8 . The method of claim 6, wherein said detection step comprises 
utilizing a detectable label capable of generating a measurable signal. 

20 

9 . The method of claim 8, wherein said detectable label is reacted to a 
solid phase. 

10. A test kit useful for detecting BS 1 24 polynucleotide in a test 
25 sample, comprising a container containing at least one BS 124 polynucleotide 

having at least 50% identity with a sequence selected from the group consisting of 
SEQUENCE ID NO 1 , SEQUENCE ID NO 2, SEQUENCE ID NO 3, 
SEQUENCE ED NO 4, SEQUENCE ID NO 5, and fragments or complements 
thereof. 

30 

11. A purified polynucleotide, or a fragment thereof, derived from a 
BS124 gene, wherein said polynucleotide is capable of selectively hybridizing to 
the nucleic acid of said BS124 gene and has at least 50% identity with a 
polynucleotide selected from the group consisting of (a) SEQUENCE ID NO 1, 

35 SEQUENCE ID NO 2, SEQUENCE ED NO 4, SEQUENCE ED NO 5, and 
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complements thereof, and (b) fragments of SEQUENCE ID NO 1, SEQUENCE 
ID NO 2, and SEQUENCE ID NO 3. 

1 2. The purified polynucleotide of claim 1 1 , wherein said 
5 polynucleotide is produced by recombinant techniques. 

1 3 . The purified polynucleotide of claim 1 1 , wherein said 
polynucleotide is produced by synthetic techniques. 

10 14. The purified polynucleotide of claim 1 1 , wherein said 

polynucleotide comprises a sequence encoding at least one BS124 epitope. 

15. A recombinant expression system comprising a nucleic acid 
sequence that includes an open reading frame derived from BS124 operably linked 

15 to a control sequence compatible with a desired host, wherein said nucleic acid 
sequence has at least 50% identity with a sequence selected from the group 
consisting of SEQUENCE ID NO 1, SEQUENCE ID NO 2, SEQUENCE ID NO 
3, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and fragments or complements 
thereof. 

20 

16. A cell transfected with the recombinant expression system of claim 

15. 

17. A BS 124 polypeptide having at least 50% identity with an amino 
25 acid sequence selected from the group consisting of SEQUENCE ID NO 22, 

SEQUENCE ID NO 23, SEQUENCE ID NO 24, SEQUENCE ID NO 25, and 
fragments thereof. 

1 8 . The polypeptide of claim 1 7, wherein said polypeptide is produced 
30 by recombinant techniques. 

19. The polypeptide of claim 1 7, wherein said polypeptide is produced 
by synthetic techniques. 

35 20. An antibody which specifically binds to at least one BS 124 

epitope, wherein said BS 1 24 epitope is derived from an amino acid sequence 
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having at least 50% identity with an amino acid sequence selected from the group 
consisting of SEQUENCE ID NO 22, SEQUENCE ID NO 23, SEQUENCE ID 
NO 24, SEQUENCE ID NO 25, and fragments thereof. 

21. An assay kit for determining the presence of BS 1 24 antigen or anti- 
BS 124 antibody in a test sample, comprising a container containing a BS 124 
polypeptide having at least 50% identity with an amino acid sequence selected 
from the group consisting of SEQUENCE ID NO 22, SEQUENCE ID NO 23, 
SEQUENCE ID NO 24, SEQUENCE ID NO 25, and fragments thereof. 

22. The assay kit of claim 21, wherein said polypeptide is attached to a 
solid phase. 

23. An assay kit for determining the presence of BS 124 antigen in a 
test sample, comprising a container containing an antibody which specifically 
binds to a BS124 antigen which comprises at least one BS124 epitope. 

24. The kit of claim 23, wherein said antibody is attached to a solid 

phase. 

25 . A method for producing a polypeptide comprising at least one 
BS124 epitope, said method comprising incubating host cells that have been 
transfected with an expression vector containing a polynucleotide sequence 
encoding a polypeptide, wherein said polypeptide comprises an amino acid 
sequence having at least 50% identity with an amino acid sequence selected from 
the group consisting of SEQUENCE ID NO 22, SEQUENCE ID NO 23, 
SEQUENCE ID NO 24, SEQUENCE ID NO 25, and fragments thereof. 

26. A method for detecting BS 124 antigen in a test sample suspected of 
containing said BS124 antigen, comprising: 

(a) contacting the test sample with an antibody or fragment thereof 
which specifically binds to at least one epitope of a BS124 antigen selected from 
the group consisting of SEQUENCE ID NO 22, SEQUENCE ID NO 23, 
SEQUENCE ID NO 24, SEQUENCE ID NO 25, and fragments thereof, wherein 
said contacting is carried out for a time and under conditions sufficient for the 
formation of antibody/antigen complexes; and 
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(b) detecting the presence of said complexes as an indication of the 
presence of said BS 124 antigen. 

27. The method of claim 26, wherein said antibody is attached to a 
5 solid phase. 

28. A method for detecting the presence of antibodies specific for a 
BS124 antigen in a test sample suspected of containing such antibodies, said 
method comprising: 

10 (a) contacting the test sample with a BS 1 24 polypeptide, wherein said 

BS 124 polypeptide contains at least one BS 124 epitope derived from an amino 
acid sequence or fragment thereof having at least 50% identity with an amino acid 
sequence selected from the group consisting of SEQUENCE ID NO 22, 
SEQUENCE ED NO 23, SEQUENCE ID NO 24, SEQUENCE ID NO 25, and 

15 fragments thereof, and further wherein said contacting is carried out for a time and 
under conditions sufficient to allow antigen/antibody complexes to form; and 

(b) detecting the presence of said complexes as an indication of the 
presence of antibodies specific for a BS124 antigen. 

20 29. The method of claim 28, wherein said BS 124 polypeptide is 

attached to a solid phase. 

30. A cell transfected with a nucleic acid sequence encoding at least one 
BS 124 epitope, wherein said nucleic acid sequence is selected from the group 

25 consisting of SEQUENCE ID NO 1 , SEQUENCE ID NO 2, SEQUENCE ID NO 
3, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and fragments or complements 
thereof. 

31. A method for producing antibodies which specifically bind to 

30 BS 124 antigen, said method comprising administering to an individual an isolated 
immunogenic polypeptide or fragment thereof in an amount sufficient to elicit an 
immune response, wherein said immunogenic polypeptide comprises at least one 
BS124 epitope and has at least 50% identity with a sequence selected from the 
group consisting of SEQUENCE ED NO 22, SEQUENCE ED NO 23, 

35 SEQUENCE ED NO 24, SEQUENCE ID NO 25, and fragments thereof. 
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32. A method for producing antibodies which specifically bind to 
BS124 antigen, comprising administering to an individual a plasmid comprising a 
sequence which encodes at least one BS124 epitope derived from a polypeptide 
having an amino acid sequence selected from the group consisting of SEQUENCE 

5 ID NO 22, SEQUENCE ID NO 23, SEQUENCE ID NO 24, SEQUENCE ID NO 
25, and fragments thereof. 

33 . A composition of matter comprising a BS 1 24 polynucleotide or 
fragment thereof, wherein said polynucleotide has at least 50% identity with a 

10 polynucleotide selected from the group consisting of (a) SEQUENCE ID NO 1, 
SEQUENCE ID NO 2, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and 
complements thereof, and (b) fragments of SEQUENCE ID NO 1, SEQUENCE 
ID NO 2, and SEQUENCE ID NO 3. 

15 34. A composition of matter comprising a polypeptide containing at 

least one BS124 epitope, wherein said polypeptide has at least 50% identity with a 
sequence selected from the group consisting of SEQUENCE ID NO 22, 
SEQUENCE ID NO 23, SEQUENCE ID NO 24, SEQUENCE ID NO 25, and 
fragments thereof. 

20 

35. The test kit of claim 10 further comprising a container with tools 
useful for collection of said sample, wherein said tools are selected from the group 
consisting of lancets, absorbent paper, cloth, swabs and cups. 

25 36. The assay kit of claim 21 further comprising a container with tools 

useful for collection of said sample, wherein said tools are selected from the group 
consisting of lancets, absorbent paper, cloth, swabs and cups. 

37. The test kit of claim 23 further comprising a container with tools 
30 useful for collection of said sample, wherein said tools are selected from the group 

consisting of lancets, absorbent paper, cloth, swabs and cups. 

38. A gene, or a fragment thereof, which codes for a BS 124 protein 
which comprises an amino acid sequence having at least 50% identity to 

35 SEQUENCE ID NO 22. 
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39. A gene, or a fragment thereof, comprising DNA having at least 
50% identity with SEQUENCE ID NO 4 or SEQUENCE ID NO 5. 

40. The method of claim 1 , wherein presence of said target BS 124 
polynucleotide in said test sample is indicative of breast disease. 

4 1 . The method of claim 3, wherein presence of said amplicon is 
indicative of breast disease. 

42. The method of claim 6, wherein presence of said second stage 
reaction product is indicative of breast disease. 

43 . The method of claim 26, wherein detection of said complexes is 
indicative of breast disease. 

44. The method of claim 28, wherein detection of said complexes is 
indicative of breast disease. 
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RE AGENTS AND METHODS USEFUL FOR DETECTING 
DISEASES OF THE BREAST 



Abstract of the Disclosure 

A set of contiguous and partially overlapping cDNA sequences and 
polypeptides encoded thereby, designated as BS124 and transcribed from breast 
tissue, is described. These sequences are useful for detecting, diagnosing, 
staging, monitoring, prognosticating, preventing or treating, or determining the 
predisposition of an individual to diseases and conditions of the breast, such as 
breast cancer. Also provided are antibodies which specifically bind to a BS 124- 
encoded polypeptide or protein, and agonists or inhibitors which prevent action of 
the tissue-specific BS124 polypeptide, which molecules are useful for the 
therapeutic treatment of breast diseases, tumors or metastases. 
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Figure 1A 



>1730294 GACGCCCAGT GACCTGCCGA GGTCGGCAGC ACAGAGCTCT GGAGATGAAG 

>17302 9 4IH GACGCCCAGT GACCTGCCGA GGTCGGCAGC ACAGAGCTCT GGAGATGAAG 

Consensus GACGCCCAGT GACCTGCCGA GGTCGGCAGC ACAGAGCTCT GGAGATGAAG 

>1730294 ACCCTGTTCC TGGGTGTCAC GCTCGGCCTG GCCGCTGCCC TGTCCTTCAC 

>1730294IH ACCCTGTTCC TGGGTGTCAC GCTCGGCCTG GCCGCTGCCC TGTCCTTCAC 

Consensus ACCCTGTTCC TGGGTGTCAC GCTCGGCCTG GCCGCTGCCC TGTCCTTCAC 

>1730294 CCTGGAGGAG GAGGATATCA CAGGGACCTG GTACGTGAAG GCCATGGTGG 

>1730294IH CCTGGAGGAG GAGGATATCA CAGGGACCTG GTACGTGAAG GCCATGGTGG 
>1213903 GGATATCA CAGGGACCTG GTACGTGAAG GCCATGGTGG 

Consensus CCTGGAGGAG GAGGATATCA CAGGGACCTG GTACGTGAAG GCCATGGTGG 

>1730294 TCGATAAGGA CTTTCCGGAG GACAGGAGGC CCAGGAAGGT GTCCCCAGTG 

>1730294IH TCGATAAGGA CTTTCCGGAG GACAGGAGGC CCAGGAAGGT GTCCCCAGTG 

>1213903 TCGATAAGGA CTTTCCGGAG GACAGGAGGC CCAGGAAGGT GTCCCCAGTG 

Consensus TCGATAAGGA CTTTCCGGAG GACAGGAGGC CCAGGAAGGT GTCCCCAGTG 

>1730294 AAGGTGACAG CCCTGGGCGG TGGGAAGTTG GAAGCC 

>173 0294IH AAGGTGACAG CCCTGGGCGG TGGGAAGTTG GAAGCCACGT TCACCTTCAT 

>1213903 AAGGTGACAG CCCTGGGCGG TGGGAAGTTG GAAGCCACGT TCACCTTCAT 

Consensus AAGGTGACAG CCCTGGGCGG TGGGAAGTTG GAAGCCACGT TCACCTTCAT 

>1730294IH GAGGGAGGAT CGGTGCATCC AGAAGAAAAT CCTGATGCGG AAGACGGAGG 

>1213903 GAGGGAGGAT CGGTGCATCC AGAAGAAAAT CCTGATGCGG AAGACGGAGG 

Consensus GAGGGAGGAT CGGTGCATCC AGAAGAAAAT CCTGATGCGG AAGACGGAGG 

>1730294IH AGCCTGGCAA ATACAGCGCC TATGGGGGCA GGAAGCTCAT GTACCTGCAG 

>1213903 AGCCTGGCAA ATACAGCGCC TATGGGGGCA GGAAGCTCAT GTACCTGCAG 

Consensus AGCCTGGCAA ATACAGCGCC TATGGGGGCA GGAAGCTCAT GTACCTGCAG 

>1730294IH GAGCTGCCCA GGAGGGACCA CTACATCTTT TACTGCAAAG ACCAGCACCA 

>1213903 GAGCTGC 

<g2185139 CTGCCCA GGAGGGACCA CTACATCTTT TACTGCAAAG ACCAGCACCA 

Consensus GAGCTGCCCA GGAGGGACCA CTACATCTTT TACTGCAAAG ACCAGCACCA 

>1730294IH TGGGGGCCTG CTCCACATGG GAAAGCTTGT GGGTAGGAAT TCTGATACCA 

<g218513 9 TGGGGGCCTG CTCCACATGG GAAAGCTTGT GGGTAGGAAT TCTGATACCA 

Consensus TGGGGGCCTG CTCCACATGG GAAAGCTTGT GGGTAGGAAT TCTGATACCA 



>1730294IH ACCGGGAGGC CCTGGAAGAA TTTAAGAAAT TGGTGCAGCG CAAGGGACTC 

<g218513 9 ACCGGGAGGC CCTGGAAGAA TTTAAGAAAT TGGTGCAGCG CAAGGGACTC 

Consensus ACCGGGAGGC CCTGGAAGAA TTTAAGAAAT TGGTGCAGCG CAAGGGACTC 



>1730294IH 

<g2185139 

Consensus 



TCGGAGGAGG ACATTTTCAC GCCCCTGCAG ACGGGAAGCT GCGTTCCCGA 
TCGGAGGAGG ACATTTTCAC GCCCCTGCAG ACGGGAAGCT GCGTTCCCGA 
TCGGAGGAGG ACATTTTCAC GCCCCTGCAG ACGGGAAGCT GCGTTCCCGA 



Figure IB 
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<g2185139 

Consensus 

>1730294IH 
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Consensus 
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ACACTAGGCA 
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ACACAGAGCC 



GCCCCCGGGT 
GCCCCCGGGT 
GCCCCCGGGT 

CGGACCACCT 
CGGACCACCT 
CGGACCACCT 



CTGCACCTCC 
CTGCACCTCC 
CTGCACCTCC 

GGACCTACCC 
GGACCTACCC 
GGACCTACCC 



AGAGCCCACC 
AGAGCCCACC 
AGAGCCCACC 

TCCAGCCATG 
TCCAGCCATG 
TCCAGCCATG 



CTACCACCAG 
CTACCACCAG 
CTACCACCAG 

ACCCTTCCCT 
ACCCTTCCCT 
ACCCTTCCCT 



>1730294IH 

<g2185139 

Consensus 



GCTCCCACCC ACCTGACTCC AAATAAAGTC CTTCTCCCCC CA 
GCTCCCACCC ACCTGACTCC AAATAAAGTC CTTCTCCCCC 
GCTCCCACCC ACCTGACTCC AAATAAAGTC CTTCTCCCCC CA 
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Figure 5A 




Lane Sample 

1 Water, reagent control 

2 Transcript RNA 

3 T47D RNA (neat) 

4 T47D RNA (diluted 1:10) 

5 Human Placenta DNA 



Figure 5B 



LCx® Results in counts/sec/sec 
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at 40 cycles 
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T-47D Cell Line 
at 45 cycles 
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Figure 6B 
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Lane Tissue 

1 BPH 

2 Prostate Cancer 

3 Lung Cancer 

4 Normal Colon 

5 Colon Cancer 

6 Normal Bladder 

7 Bladder Cancer 



Lane Tissue 

8 Normal Bladder 

9 Bladder Cancer 

10 Normal Breast 

11 Normal Breast 

12 Normal Breast 

13 Breast Cancer 

14 Markers 



